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DX AGNOSTIC REAGENTS RELATING TO DIABETES 

The present invention relates to an autoantigen 
associated with type I diabetes (Insulin Dependent Diabetes 
Mellitus (IDDM) ) . This autoantigen is useful in the 
5 diagnosis of IDDM. 

IDDM is an autoimmune disease. It is caused by the 
autoimmune system-mediated destruction of pancreatic islet 
beta-cells. This leads to an imbalance in the levels of 
the hormones insulin and glucagon which disrupts glucose 
10 metabolism, leading to the well-known symptoms of diabetes, 
such as high glucose levels in urine. As IDDM is an 
autoimmune disease, autoantigens can often be detected in 
the blood of individuals suffering from IDDM. 

Some characterisation of the autoantigens involved in 
15 IDDM has already been achieved. In particular, two 

autoantigens are known to be closely associated with the 
development of IDDM; these are known as the 3 7 kDa and 
40kDa autoantigens, because of their respective molecular 
weights. It has recently been suggested that the 40kDa 
20 autoantigen is a protein known as IA-2 (Payton et al (1995) 
J. Clin. Invest. 96, 1506-1511). 

Human IA-2 (hIA-2) was originally cloned in a 
truncated form, and given the name ICA-512 (Islet Cell 
Antigen 512) (Rabin et al (1994) J. Immunol. 152, 3183). 
25 Subsequently, the complete DNA and amino acid sequences 
were determined and the protein was given the name IA-2 
(Lan et al (1994) DNA and Cell Biology vol 13, No 5, pp 
505-514) . 

A mouse homologue (mIA-2) is also known (Lu et al 
30 (1994) Biophys. Biochem. Res. Commun. vol 204, No 2). Its 
cDNA shows 92% identity to that of hIA-2. Two possible rat 
homologues have also been identified. The first of these 
is known as PTPLP (Kambayashi et al (1995) Biochem. J. 306, 
331-335) and has 79.3% identity to hIA-2 at the amino acid 
35 level. The second has a sequence identified from a series 
of overlapping clones (Passini et al (1995), Proc . Natl. 
Acad. Sci. USA 92, 9412-9416). This DNA shows 93% homology 
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to the corresponding mIA-2 sequence and 86% homology to the 
corresponding hIA-2 sequence. 

As stated above, it has been suggested that hi A- 2 is 
the 4 0kDA autoantigen associated with IDDM . 
5 Other putative IDDM-associated autoantigens include 

glutamic acid decarboxylase (GAD) and the islet cell 
autoantigens (ICA) known as ICA-12, ICA-13, ICA-208, ICA- 
302, ICA-313, ICA-123, ICA-525 and ICA-505 (EP-A-0, 383, 
129; EP-A-0, 569,800). 

10 Some of these IDDM-associated antoantigens have been 

proposed as diagnostic reagents for screening patients for 
IDDM, notably GAD, ICA-512 and ICA-12 (EP-A-0 569, 800) . 

Whilst searching for protein tyrosine phosphatases, 
the present inventors unexpectedly cloned and sequenced a 

15 cDNA fragment, and then a complete cDNA, encoding part of a 
new probable IDDM-associated autoantigen. This has been 
give the name IAR (Islet Cell Antigen-Related) and may also 
be referred to as IAR-PTP (Islet Cell Related Protein 
Tyrosine Phosphatase) . 

20 Analysis of this partial sequence shows that, like the 

known autoantigens mentioned above, it has an extracellular 
region and an intracellular region. The partial IAR 
sequence encodes around 4 60 amino acids from the 
extracellular domain and the whole of the intracellular 

25 domain. It is likely that the intracellular domain is an 
antigenic part of the protein. 

The IAR sequence identified by the inventors shows 
around 70% nucleotide and amino acid sequence homology to 
the IA-2 sequence in the intracellular region. This 

3 0 provides further evidence that IAR is an IDDM-associated 
autoantigen. In the extracellular region, the nucleotide 
homology to IA-2 is about 35% and the amino acid sequence 
homology is about 20% . 

Although previous work has suggested that hIA-2, PTLTP 

3 5 and mIA-2 do not have protein tyrosine phosphatase (PTP) 
activity, the present inventors have found that hIA-2 in 
fact has low but detectable PTP activity. Like hIA-2, IAR 
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has PTP activity, and is also referred to herein as IAR-PTP 
(Islet Cell Antigen-Related Protein Tyrosine Phosphatase) . 

Further, IAR has a similar tissue expression pattern 
to IA-2, being highly expressed in the brain and pancreas. 
5 Thus, IAR has three characteristics of an IDDM-associated 
autoantigen: (i) a characteristic structure, in terms of 
organisation into intra- and extra- cellular domains and 
the sequence of its intracellular domain; (ii) PTP 
activity; and (iii) a characteristic tissue distribution. 

10 IAR is also reactive with IDDM patient sera. A study 

with a group of IDDM patients shows that recombinant IAR is 
precipitated by autoantibodies present in more than 50% of 
the serum samples while IAR does not react with any sera 
from control patients. 

15 Nonetheless, it is clear that IAR is distinct from 

previously isolated autoantigens ; in particular, its 
sequence identity to hIA-2, mIA-2 and PTPLP is too low for 
it to be any of these proteins, especially as it has only 
20% homology to hIA-2 at the amino acid level in the 

20 extracellular region. Further, IAR and IA-2 can 

distinguish different populations of IDDM autoantibodies. 

The inventors have also carried out a study to. compare 
the IDDM sensitivity and predictive, value of IAR antibodies 
(IAR Ab) and IA-2 antibodies (IA-2 Ab) . Antibodies to IAR 

25 and IA-2, measured by radiobinding assay, were correlated 

in recent -onset IDDM patients. Some individuals had IAR Ab 
in the absence of IA-2 Ab, and others had IA-2 Ab in the 
absence of IAR Ab. Therefore, IAR and IA-2 can distinguish 
different populations of IDDM autoantibodies since they 

30 identify overlapping but non-identical sets of IDDM 

patients. The diagnostic sensitivity for IDDM was similar 
for IAR Ab and IA-2 Ab. Accordingly, IAR is clearly a 
useful predictive tool for the diagnosis of IDDM, either as 
an alternative IA-2 or in combination with IA-2. 

35 Moreover, in a group of first degree relatives of IDDM 

sufferers, the presence of IAR Ab were significantly 
. associated with progression to IDDM while IA-2 Ab were not. 



BNSDOCID: <WO 9722694 A2> 



WO 97/22694 




PCT/CA96/00867 



- 4 - 

This is significant because it establishes that IAR Ab are 
likely to be better predictors of IDDM than IA-2 Ab in 
first degree relatives of IDDM sufferers, who are a high 
risk group for contracting IDDM. Therefore, the IAR of the 
5 invention is likely to be effective in predicting the onset 
of IDDM in high-risk groups such as first-degree relatives. 

Further, the inventors have found that the cytoplasmic 
(intracellular) domain of IAR is itself a useful predictor 
of IDDM. It is likely to be as useful as, or more useful 

10 than, complete IAR. 

A brain Expressed Sequence Tag (EST) is available 
which has a sequence corresponding to part of the IAR 
sequence and highly homologous to the human IA-2 sequence. 
This is EST03250 (Adams et aJ; Nature Genetics Vol. 4, July 

15 1993) . The length of the EST is 313 nucleotides. It was 
identified during the partial sequencing of over 3400 ESTs 
from human brain cDNA, in an attempt to demonstrate the 
diversity of RNA transcripts present in brain cells. IAR 
of course differs in terms of sequence from this EST 

20 because it has a different, longer, sequence. Also, it is 
unknown whether or not the EST encodes a functional 
autoantigen. 

IAR sequences of the invention can be used in the 
diagnosis of IDDM, for example to screen for or predict the 

25 onset, presence or development of IDDM. For example, 

individuals who have, or are predisposed to develop, IDDM 
can be expected to have autoantibodies to the auroantigenic 
IAR in their blood. Thus, blood, serum or other samples 
can be taken and contacted with the IAR of the invention. 

30 The anti-IAR autoantibodies will recognise the exogenous 
IAR and this recognition can be detected, for example by 
detecting the IAR/anti-IAR complex thus formed. 

Accordingly, the present invention provides: 

A nucleic acid encoding a polypeptide that has the 

35 properties of an Insulin -dependent diabetes mellitus 
{ IDDM) -associated autoantigen, which nucleic acid 
comprises: 
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(a) the coding sequence of SEQ. ID. No. 1, 10 or 12 and/or 
the sequence complementary thereto ; 

(b) a sequence which hybridises to a sequence defined in 
(a) ; 

5 (c) a sequence that is degenerate as a result of the 

genetic code to a nucleic acid sequence defined in (a) 
or (b) ; 

(d) a sequence having at least 80% homology to a sequence 
defined in (a) , (b) or (c) ; 
10 a polypeptide encoded by such a nucleic acid; 

a nucleic acid vector including such a nucleic acid; ; 
a cell harbouring such a vector; 

a method of diagnosing IDDM which comprises: 

(a) contacting a sample from an individual with a 

15 polypeptide as defined above under conditions that 

permit the recognition of the polypeptide by 
autoantibodies that recognise IAR; and 

(b) determining whether or not said recognition occurs. 

a method of producing a polypeptide as defined 
20 above which comprises culturing a cell as defined above 
under conditions which permit the expression of the 
polypeptide; and recovering the polypeptide ; 

use of a polypeptide as defined above in diagnosing 

IDDM; 

25 a diagnostic test kit for diagnosing IDDM which 

comprises a polypeptide as defined above, and means for 
determining whether or not the polypeptide is recognised by 
autoantibodies that recognise IAR. 



30 BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1. Northern analysis of r75 tissue expression. 
Human multiple tissue Northern blots are from Clontech and 
the source of the mRNA in each lane is indicated at the 
35 bottom of the figure. The r75 probe was labelled with 32 P- 
dCTP (Amersham) using the High Prime DNA Labeling Kit 
(Boehringer Mannheim) . Hybridizations were carried out at 
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65°C overnight in 0 . 5M sodium phosphate (pH 6.8), 7% SDS, 
15% formamide, ImM EDTA . After washing 3 times with 50mM 
sodium phosphate (pH 6.8), 1% SDS for 40 min at 65°C, the 
blots were exposed to film at -70°C. Blots were stripped 
5 of probe by incubation in freshly boiled 0.1% SDS (3x2 
min) before reprobing. Top panels, r75; bottom panels, 0- 
actin. The positions of molecular size markers (in kb) are 
shown to the left of each blot. 

10 Figure 2. Schematic depiction of IAR clones and predicted 
protein prod :t. 

A. Overlapping cDNA clones C3 and 3-7. Nucleotide numbering 
is as in SEQ. ID. No.l. 

B. Predicted IAR protein product aligned with the IA-2 
15 protein. The predicted partial IAR polypeptide (top) 

translated from and aligned with the clones shown in A is 
depicted in alignment with the IA-2 protein (Lan et al . , 
1994) . The % amino acid similarity between the 
intracellular regions of IAR and IA-2 is shown, as is that; 
20 between the partial extracellular region of IAR and the 
corresponding portion of IA-2. TM, transmembrane region. 

C. Relationships of cDNA clones. The relatwe positions of 
clones Bll, C3 , 3-7 and 3-10 are shown. The coding portion 
of these clones is represented by the thick line on the top 

25 scale. Both strands of clones Bll, C3 , and 3-7 were 

sequenced in their entirety. The angled 5' piece of C3 is 
not identical to the corresponding region of Bll and likely 
represents a cDNA library artifact, as does the boxed 
region (54 bp) of C3 which is a sequence found in the 

30 opposite orientation in the corresponding region of Bll. 

Figure 3. Northern analysis of IA-2 tissue expression. 

Human multiple tissue Northern blots (different membranes 
from those used for Fig.l) are from Clontech and the source 
35 of the mRNA in each lane is indicated at the bottom of the 
figure. The IA-2 probe (nucleotides 1590-1906, coding for 
a portion of the extracellular region, the transmembrane 
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region, and a small portion of the intracellular region; 
and non-homologous to IAR sequence) was labelled and 
hybridization carried out as described in the legend to 
Fig. 1. The position of the IA-2 transcript is indicated 
5 by the arrow at the right of the figure. The £-actin 

signal (position indicated by the parentheses to the right 
of each panel) results from a previous hybrization with a 
/?-actin probe, and could not be completely stripped from 
the membranes before reprobing with IA-2. The positions of 
10 molecular size markers (in kb) are shown to the left of 
each blot. 

Figure 4. Northern analysis of IAR tissue expression. 

Human multiple tissue Northern blots are from Clontech and 
15 the source of the mRNA in each lane is indicated at the 
bottom of the figure. Membranes were stripped of IA-2 
probe (Fig. 2) by incubation in freshly boiled 0.1% SDS (3 x 
2 min) . The IAR probe (nucleotides 790-1197 of SEQ. ID. 
No.l, coding for a portion of the predicted extracellular 

2 0 region and non- homologous to IA-2 sequence) was labelled 

and hybridization carried out as described in the legend to 
Fig. 1. The positions of IAR transcripts are indicated by 
the arrows at the right of the figure. The 0-actin signal 
(position indicated by the parentheses to the right of each 
25 panel) results from a previous hybrization with a 0-actin 
probe, and could not be completely stripped from the 
membranes before reprobing with IAR. The positions of 
molecular size markers (in kb) are shown to the left of 
each blot . 

30 

Figure 5. Comparison of IAR and IA-2 transcripts in human 
brain and pancreas. The indicated lanes from the Northern 
analyses of IA-2 (Fig. 3) and IAR expression (Fig. 4) are 
shown together here for the purpose of direct comparison. 

3 5 The same membrane was hybridized with the IA-2 probe, then 

stripped and rehybridized with the IAR probe as described 
in the legends to Figs . 3 and 4 . 
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Figure 6* Purification and phosphatase activity of the 
intracellular regions of IAR and IA-2, 

A* Purified IA-2 (open arrow) and IAR polypeptides (closed 
arrows) obtained after expression, affinity purification 
5 and protease cleavage of GST- fusion proteins. Lane 1, IA-2 
obtained from expression of pGEX-3C-IA-2 and cleavage with 
3C protease; lane 2, IA-2 obtained from expression of pGEX- 
KG-IA-2 and cleavage with thrombin; lane 3, IAR obtained 
from expression of pGEX-3C-IAR and cleavage with 3C 
10 protease; lane 4, IAR obtained from expression of pGEX-KG - 
IAR and cleavage with thrombin. The positions of molecular 
size markers (kD) are shown to the right of the panel. 

B. Phosphatase activity of IAR. The IAR was produced as in 
lane 3 of panel A. IAR was added to a reaction mixture 

15 containing 50mM sodium acetate (pH 4.5), 0.5 mg/ml BSA, 
0.5mM dithiothreitol and 5mM para-nitrophenyl phosphate 
(PNPP) and incubated at 30°C. At the times indicated, 
aliquots of the reaction (each containing 16/zg of purified 
IAR as measured by Bradford analysis) were removed and 

20 mixed with 13% KHP0 4 to terminate the reaction. The 

absorbance of the stopped reaction was monitored at 4 05 nrn 
and is a measure of the amount of para- nit rophenol product 
formed by dephosphorylation of PNPP. All reactions were 
performed in duplicate and the absorbances averaged. Non - 

2 5 enzymatic hydrolysis of PNPP was accounted for by 

performing appropriate control reactions without added IAR, 
and this absorbance was subtracted from that attained in 
the presence of enzyme. 

C. Phosphatase activity of IA-2. The IA-2 was produced as 
30 in lane 1 of panel A, and phosphatase assays carried out as 

described above for IAR except that reactions contained 2mM 
PNPP. Each reaction aliquot removed at the indicated times 
contained 20;*g of purified IA-2 as measured by B. -dford 
analysis . 

35 

Figure 7. Catalytic activity of IAR. 

A. SDS-PAGE of IAR polypeptide (amino acids 646-1015) 
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obtained after expression, affinity purification, and 
proteolytic cleavage of GST- IAR fusion protein. Lane 1, 
molecular size markers (sizes in kDa shown at left) ; lane 
2, IAR after cleavage with 3C protease. The two uppermost 
5 bands in lane 2 are bacterial proteins which are often 

present after purification of various GST-fusion proteins, 
including IAR(C945S). 

B* pH profile of IAR activity towards PNPP. All reactions 
were carried out for 30 min and contained 22.3/ig/ml IAR and 

10 2mM PNPP. The buffers used were 50mM sodium acetate (pH 4.0 
- 5.0), Mes (pH 5.5 - 6.5) or Tris (pH 7.0 - 8.5) . 
C. Time -dependent dephosphorylation of PNPP by IAR. 
Reactions containing 44.4Mg/ml IAR (■) or IAR(C945S) (o) 
and 5mM pNPP were carried out as described in Materials and 

15 Methods and stopped at the times indicated. IAR activity 
was also measured in the presence of ImM Na3VO< U) . 
Control reactions lacking IAR were simultaneously performed 
and the resulting absorbance at 4 0 5nm subtracted from that 
obtained from reactions carried out in the presence of IAR. 

20 The data shown are the average of duplicate experiments. 

Prior to use in the assays described above in B and C, the 
purified IAR was quantitated by densitometric scanning of 
Coomassie Blue-stained IAR protein band on SDS-PAGE 
(indicated by arrow in A) alongside known amounts of 

25 protein standards. 

Figure 8- Reactivity of IAR with IDDM sera. 

A. Immunoprecipitation of IAR with sera from control and 
recent-onset IDDM subjects. In vitro translated 35 S- 

30 methionine labelled IAR was incubated with sera (prebound 
to Protein A-Sepharose) from control (subjects 5 and 6 in 
Table 1) (lanes 1 and 2) and IDDM subjects (subjects 12, 3, 
and 18 in Table 1) (lanes 3-5) . Immunoprecipitates were 
analyzed by SDS-PAGE and autoradiography. The 41 and 3 2 kDa 

3 5 bands (arrowed) are IAR and an IAR proteolytic product, 
respectively . 

B. Graphical depiction of control and IDDM subject sera 
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reactivity with IAR. The exact values of the data presented 
here and further subject information are given in Table l. 
Experiments were carried out as in A and the 
immunoprecipitated IAR was resolved by SDS-PAGE and 
5 quantitated (in arbitrary density units) using a 
phosphor imager . 

Figure 9 • Relationship IAR Ab and IA-2 Ab in recent 
-onset IDDM patients 

10 Dotted lines represent thresholds of positivity. 
r=0 . 70, p<0 . 0001 . 

Figure 10* Effect of IAR Ab (A) and IA-2 Ab (B) on IDDM- 
free survival in first-degree relatives with ICA > 20JDFU 
15 or IAA > lOOnU/ml 

Dotted line: antibody-negative subjects, solid line: 
antibody positive subjects. A : p<0 . 0001 , B : Not Significant. 

The invention provides nucleic acids encoding the 

20 IDDM-associated autoantigen known as IAR or IAR-PTP and 

other related polypeptides having the properties of IDDM- 
associated antigens. Nucleic acids of the invention 
include the partial IAR-PTP cDNA having the sequence shown 
in SEQ ID No. 1, the complete cDNA sequence (SEQ ID. No 10) 

25 and the cDNA sequence of the cytoplasmic region of the IAR 
(SEQ ID. No. 12) . 

The nucleic acids of SEQ ID Nos . 1 and 10 are 
preferred nucleic acids of the invention. In another 
preferred embodiment, the nucleic acid of the invention 

30 includes a sequence encoding the cytoplasmic domain of IA:^, 
also known as the intracellular domain (SEQ ID No. 12) . 
The DNA sequence encoding the cytoplasmic domain is given 
in SEQ ID No. 12, along with the encoded amino acid 
sequence of the cytoplasmic domain (SEQ ID NO. 13). The 

3 5 nucleotide sequence of SEQ ID NO. 12 corresponds to the 

nucleotides which encode amino acids 646 to 1015 of SEQ ID 
NO. 11. In this embodiment, the sequence of the invention 
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may consist of the sequence encoding the cytoplasmic domain 
or it may include other IAR sequences or non-IAR sequences 
at one or both ends. 

However, it will be appreciated that other nucleic 
5 acids are included within the scope of the invention. 

Nucleic acids of the invention may have substantially 
the sequence of SEQ ID No. 1, 10 or 12. 

Nucleic acids of the invention may consist essentially 
of the sequence of SEQ ID No. 1, 10 or 12. 
10 The nucleic acid sequences of the present invention 

are preferably DNA, for example, cDNA or genomic DNA, 
though they may be RNA. It will be appreciated by those of 
skill in the art that, in RNA sequences according to the 
invention, the T residues shown in SEQ ID No. 1, 10 or 12 
15 will be replaced by U. Further, the invention provides 
both single-stranded and double -stranded nucleic acids. 

The nucleic acids of the present invention are not 
limited to the nucleic acids of SEQ. ID. No. 1, 10 and 12. 
The nucleic acids of the invention include nucleic acids 
20 having sequences that are closely related to these 

sequences; these nucleic acids encode polypeptides of the 
invention . 

Such nucleic acids of the invention may, for example, 
be prepared by altering that of SEQ ID No. 1, 10 or 12 by 

25 any conventional method, or isolated from any organism or 
made synthetically. Such alterations, isolations or 
syntheses may be performed by any suitable method, for 
example by the methods of Sambrook et al : {Molecular 
Cloning: A Laboratory Manual; 1989) . 

30 For example, nucleic acids of the invention may 

include substitutions, deletions, insertions, or extensions 
that distinguish them from the nucleic acid of SEQ. ID. No. 
1, 10 or 12 as long as these do not destroy the ability of 
the nucleic acid to encode a polypeptide that functions as 

35 an IDDM-associated autoantigen. 

A substitution, deletion or insertion may suitably 
involve one or more nucleotide positions. Thus, nucleic 
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acid sequences of the invention may differ from those of 
SEQ ID No. 1, 10 or 12 at any number of nucleotide 
positions, for example as a result of deletion, 
substitution, insertion or extension, as long as they 
5 encode a polypeptide of the invention. Sequences of the 
invention may, for example, differ from that of SEQ ID No. 
1, 10 or 12 at 1, 2, 3, 4, 5, 6 to 10, 10 to 20, 20 to 30, 
30 to 40, 40 to 50 or 50 to 100 nucleotide positions. 

For example, the sequences of the invention include 

10 sequences that are capable of selective hybridisation to 
either strand of the sequence of SEQ. ID. No. 1, 10 or 12 
and that encode a polypeptide of the invention. Such 
sequences capable of selectively hybridizing to the DNA of 
SEQ. ID. No. 1, 10 or 12 will generally be up to 80%, 

15 preferably at least 80%, more preferably at least 90%, at 
least 95% or 99% homologous to the DNA of SEQ. ID. No. 1, 
10 or 12. Such homology will preferably apply over a 
region of at least 20, preferably at least 50, fcr instance 
100, 500 or 1000 or more contiguous nucleotides. 

20 Such hybridisation may be carried out under any 

suitable conditions known in the art (see Sambrock et al 
(1989) : Molecular Cloning: A Laboratory Manual) . For 
example, if high stringency is required, suitable 
conditions include 0.2 x SSC at 60°C. If lower stringency 

25 is required, suitable conditions include 2 x SSC at 60°C. 

Also included within the scope of the invention are 
sequences that differ from those defined above and because 
of the degeneracy of the genetic code, encode the same 
polypeptide of the invention. For example, the invention 

30 provides degenerate variants of the sequence of SEQ ID No. 

1, 10 or 12 that also encode the polypeptide of SEQ ID No. 

2, 11 or 13. 

The nucleic sequences of the invention may be of any 
length as long as they encode a polypeptide of the 
35 invention. For instance, a nucleic acid sequence according 
to the invention will typically comprise one or more 
sequences including one or more of the antigenic epitopes 
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of IAR-PTP. A nucleic acid sequence according to the 
invention may be a contiguous fragment of the sequence of 
SEQ. ID. No. 1, 10 or 12 or a sequence that is related to 
it in any of the ways described above. Alternatively, 
5 nucleic acids of the invention may comprise DNA sequences 
that are not contiguous in the sequence of SEQ. ID. No. 1, 
10 or 12 . These sequences may be fragments of the sequence 
of SEQ. ID. No. 1/ 10 or 12 or nucleic acid sequences that 
are related to such fragments in any of the ways described 

10 above. Nucleic acid sequences of the invention will 

preferably comprise at least 50 bases or base pairs, for 
example 50 to 100, 100 to 500, 500 to 1000, or 1000 to 
2000 bases or base pairs. 

In particular, it will be appreciated that SEQ. ID. 

15 No. 1 is a partial cDNA sequence of IAR-PTP. The invention 
also includes sequences extended at either the 3' end or 
the 5' end, or both ends, by further nucleic acid sequence. 
5' extensions are preferred. This sequence may be of any 
nature and of any suitable length. For example, an 

20 extension may comprise up to 10, up to 20, up to 50/, up to 
100, up to 200, up to 500, up to 1000, up to 1500,. .up to 
2000, up to 2500 or up to 3000 or more bases or base pairs. 
For example, an extension may comprise the sequence that is 
contiguous with the sequence of SEQ. ID. No. 1 in the 

25 complete IAR-PTP cDNA (SEQ ID. No. 10) . Alternatively, it 
may be any other sequence. Thus, preferred nucleic acids 
of the invention include complete IAR-PTP cDNAs and 
fragments thereof which include SEQ. ID. No. 1, 
particularly human cDNAs and fragments. Also, the 

30 invention provides nucleic acids encoding fusion proteins 
. comprising polypeptides of the invention. 

A particularly preferred nucleic acid of the invention 
has the sequence of SEQ ID No. 10, the complete IAR cDNA 
sequence. Another particularly preferred nucleic acid of 

3 5 the invention has the sequence of the cytoplasmic domain of 
IAR, SEQ ID NO. 12. 

Similarly, the extension or extensions could have any 
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other sequence. Also, the sequence of SEQ. ID. No. 1, 10 
or 12 may be interrupted in one or more places by non- 
coding sequence. In particular, the non-coding sequence 
may be introns that occur in the IAR-PTP genomic DNA. 
5 Further preferred nucleic acids of the invention are thus 
IAR-PTP genomic DNAs, and fragments thereof, that include 
the sequence of SEQ ID No. 1, 10 or 12, particularly human 
genomic DNAs and fragments thereof . 

Three preferred polypeptides of the invention have the 
10 sequences shown in SEQ. ID. No. 2, (partial amine acid 
sequence of IAR) , SEQ, ID. No. 11 (complete amine acid 
sequence of IAR) and 13 (amino acid sequence of cytoplasmic 
domain of IAR) . 

SEQ ID No. 13 corresponds to amino acids 646 to 1015 
15 of SEQ ID No. 11. However, the polypeptides of the 

invention are not limited to the polypeptides of SEQ. ID. 
No. 2, 11 and 13. Rather, the polypeptides of the 
invention also include polypeptides with sequences closely 
related to those of SEQ. ID. No. 2, 11 and 13 that have 
20 suitable properties. Thus, the polypeptides of the 

invention typically have substantially the sequence of SEQ 
ID No. 2, 11 or 13. 

Polypeptides of the invention may have a sequence 
which consists essentially of the sequence of SEQ ID No. 2, 
25 11 or 13 . 

Polypeptides of the invention may have a sequence 
which is substantially the sequence of SEQ. ID. No. 2, 11 
or 13 . 

The polypeptides of the invention are encoded by the 
30 nucleic acids of the invention, as described herein. The 
polypeptides of the invention, IAR-PTP and closely related 
polypeptides, are IDDM-associated autoant igens , or have the 
properties of IDDM-related autoantigens . 

Thus, the polypeptides of the invention are typically 
35 capable of provoking an autoimmune response in individuals 
who have, or are predisposed to have, or are lik€:ly to 
develop IDDM. Thus, the polypeptides of the invention may 
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provoke the production of IDDM-associated autoantibodies in 
such individuals. Also, or alternatively, they may provoke 
other elements of the autoimmune response. 

Further, the polypeptides of the invention are 
5 typically recognised by autoantibodies that already present 
in the sera of such individuals, and are capable of 
recognising the natural IAR-PTP which contributes to the 
autoimmune response in these individuals. 

Optionally, the polypeptides of the invention may 

10 have PTP activity. 

Polypeptides of the invention having sequences related 
to the sequence may be prepared by altering the polypeptide 
of SEQ. ID, No. 2, 11 or 13 by any conventional method, or 
isolated from any organism or made synthetically. Such 

15 alterations, isolations or syntheses may be performed by 
any conventional method, for example by the methods of 
Sambrook et al (Molecular cloning; A Laboratory Manual ; 
1989) ^ i n particular, polypeptides related to those of 
SEQ . ID. No. 2, 11 and 13 may be prepared by modifying DNA 

20 sequences as shown in SEQ. ID. No. 1, 10 or 12 and 
expressing them recombinantly . 

Polypeptides of the invention may include 
substitutions, deletions, insertions, or extensions that 
distinguish them from the polypeptide of SEQ. ID. No. 2, 11 

25 or 13 as long as these do not destroy the ability of the 
polypeptide to function as an IDDM-associated autoantigen. 

A substitution, deletion or insertion may suitably 
involve one or more amino acids, typically from one to 
five, one to ten or one to twenty amino acids. For 

3 0 example, a substitution, deletion or insertion may involve 
one, two, three, four, five, eight, ten, fifteen, or twenty 
amino acids. Polypeptides of the invention are 
substantially homologous to that of SEQ. ID. No. 2, 11 or 
13. Typically, a polypeptide of the invention has up to 

35 80%, preferably at least 80%, at least 90%, at least 95%, 
at least 98% or at least 99% sequence identity to the 
polypeptide of SEQ . ID. No. 2, 11 or 13. 
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In general, the physicochemical nature of the 
polypeptide of SEQ. ID. No. 2, 11 or 13 should be preserved 
in a sequence of the invention. Such sequences will 
generally be similar in charge, hydrophobicity and size to 
5 that of SEQ . ID. No. 2, 11 or 13. Thus, substitution will 
typically be conservative. Examples of conservative 
substitutions that do not greatly affect the 

physicochemical nature of amino acid sequences are those in 
which an amino acid from one of the following groups is 
10 substituted by a different amino acid from the same group: 

H, R and K 

I, L, V and M 
A, G, S and T 

i 

D, E, P and N. 

15 As far as extensions are concerned, a sequence of one 

or more amino acids may be provided at either or both of 
the C- and N- termini of the sequence of SEQ. ID. No. 2, 11 
or 13 or a sequence related to it in any of the ways 
defined herein. An extension may comprise up to 5, up to 

20 10, up to 20, up to 50, up to 100, up to 200, up to 300, up 
to 4 00, up to 500 or more amino acids. For example, an 
extension may comprise one, two, three, four, five, up to 
ten, up to 20, up to 30, up to 50, up to 100, up to 200 or 
more amino acids. 

25 In particular, extensions may include amino acid 

sequences which are found in the native complete IAR-PTP ( 
protein. Thus, preferred polypeptides of the invention 
include complete IAR-PTP proteins and polypeptide fragments 
thereof that include the amino acid sequence of SEQ. ID. 

30 No. 2, particularly human proteins and fragments thereof. 
Also, the invention provides fusion proteins including 
polypeptides of the invention. 

A parr : cularly preferred polypeptide of the invention 
has the sequence of SEQ. ID. No. 11, the sequence encoded 

35 by the complete IAR cDNA (SEQ. ID. No. 10). A further 

particularly preferred polypeptide has the sequence of SEQ 
ID NO. 13, which is the sequence of the cytoplasmic domain 
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of IAR. 

A polypeptide of the invention may be subjected co one 
or more chemical modifications, such as glycosylat ion, 
sulphation, COOH-amidation or acylation. 
5 A polypeptide of the invention may comprise multiple 

copies of the sequence of SEQ. ID. No. 2, 11 or 13 or a 
sequence related to it in any of the ways defined herein. 

A polypeptide of the invention may be of any length as 
long as it has the ability to function as an IDDM- 

10 associated autoantigen. For instance, polypeptides of the 
invention preferably comprise one or more epitopes of IAR- 
PTP and therefore retain the ability to provoke an 
autoimmune response and/or be recognised by preexisting 
IDDM autoantibodies. Such polypeptides of the invention 

15 could be considerably smaller than the polypeptide of SEQ. 
ID. No. 2. 

Polypeptides according to the invention may include 
one or more fragments of the sequence of SEQ. ID. No. 2, 11 
or 13 or fragments that are related to SEQ. ID. No. 2 in 
20 any of the ways described above. Such fragments typically 
comprise at least 10, for example 10 to 20 or 20 to 50, 
amino acids. 

Alternatively, polypeptides of the invention may 
comprise amino acid sequences that are not contiguous in 
25 SEQ. ID. No. 2, 11 or 13. These amino acid sequences may 
be identical to parts of the native amino acid sequence or 
related to such parts in any of the ways described herein. 
For example, polypeptides of the invention may comprise two 
or more epitopes of IAR-PTP, optionally separated by IAR 
3 0 sequence or non-IAR sequence. 

Polypeptides according to the invention preferably 
comprise at least 10 amino acids, for example 10 to 20, 20 
to 50, 50 to 100, 100 to 200, or 200 to 500 or 500 to 1000 
amino acids. 

35 Polypeptides and nucleic acids according to the 

invention may be substantially isolated, isolated, purified 
or substantially purified. For example, they may be 
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present in preparations, for example solutions, which have 
undergone one or more purification steps. Optionally, they 
may be completely pure. They may be present in 
preparations, for example solutions, which, apart from the 
5 nucleic acid and/or polypeptide of the invention consist 
essentially of solvents and/or biologically inert carriers. 
Polypeptides of the invention in substantially purified 
form will generally comprise the polypeptide in a 
preparation in which more than 90%, eg. up to 95V, up to 
10 98% or up to 99% of the peptide material in the preparation 
is that of a polypeptide or polypeptides according to the 
invention. 

The nucleic acids and polypeptides of the invention 
were originally derived from the human genome. However, 

15 nucleic acid sequences and/or polypeptides of the invention 
may also be obtained from other eukaryotic genomes, 
especially other mammalian genomes. They may be obtained 
either by conventional cloning techniques or by probing 
genomic or cDNA libraries with nucleic acid sequences 

20 according to t.he invention. This can be done by any 

conventional method, such as the methods of Sambrook et al 
(Molecular Cloning: A Laboratory Manual; 1989) . In 
particular, the complete cDNA and/or genomic DNA sequences 
of human IAR-PTP can be determined in this way, and the 

25 complete amino acid sequence of IAR-PTP can be determined. 

A nucleic acid sequence according to the invention may 
be included within a vector, suitably a replicable vector 
or an expression vector, for instance a replicable 
expression vector . 

30 Such an expression vector typically comprises an 

origin of replication so that the vector can be replicated 
in a host cell such as a bacterial host cell or a yeast 
host cell. A suitable vector will also typically comprise 
the following elements, usually in a 5' to 3 ' arrangement: 

3 5 a promoter for the directing expression of the nucleic acid 
sequence of the invention and optionally a regulator of the 
promoter, a translat ional start codon, a nucleic acid 
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sequence according to the invention. 

The vector may also contain one or more selectable 
marker genes, for example an ampicillin resistance gene for 
the identification of bacterial trans formants or a marker 
5 gene that allows selection of yeast or other eukaryotic 

transf ormants . Optionally, the vector may also comprise an 
enhancer for the promoter. The vector may also comprise a 
polyadenylation signal operably linked 3' to the nucleic 
acid of the invention. The vector may also comprise a 

10 transcriptional terminator 3' to the sequence encoding the 
polypeptide of the invention. 

The vector may also comprise one or more introns or 
other coding sequences 3' to the nucleic acid sequence of 
the invention. The intron or introns may be from the human 

15 genome (the organism from which the sequences of the 

invention were originally derived) or the host organism 
which is to be transformed with the vector or from another 
eukaryotic organism . 

In an expression vector, the nucleic acid sequence of 

20 the invention is operably linked to a promoter capable of 
expressing the sequence. "Operably linked" refers to a 
juxtaposition wherein the promoter and the nucleic acid 
sequence encoding the polypeptide of the invention are in a 
positional relationship permitting the coding sequence to 

25 be expressed under the control of the promoter. Thus, 

there may be elements such as non-coding sequence 3' or 5 ' 
to the coding sequence. These elements may be native 
either to the human genome or, for example, to the organism 
from which the promoter sequence is derived. 

30 Alternatively, the said element or elements may be 

native to neither the organism from which the promoter 
sequence is derived nor to the human genome. Such 
sequences can be included in the vector if they enhance or 
do not impair the correct control of the coding sequence by 

35 the promoter. 

An expression vector may be of any type. The vector 
may be in linear or circular form. For example, it may be 
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a plasmid or viral vector. Those of skill in the art will 
be able to prepare suitable vectors comprising nucleic acid 
sequences encoding polypeptides of the invention starting 
with widely available vectors which will be modified by 
5 genetic engineering techniques such as those described by 
Sambrook et a] (Molecular Cloning: A Laboratory Manual ; 
1989) . 

In an expression vector, any promoter capable of 
directing expression of a sequence of the invention may be 

10 operably linked to the nucleic acid sequence of rhe 

invention. For example, suitable promoters include yeast 
promoters, mammalian promoters, viral promoters and 
bacterial promoters. Such promoters may be constitutive cr 
regulable. Some suitable promoters include the 

15 cytomegalovirus (CMV) early promoter, the SV4 0 promoter, 
the mouse mammary tumour virus promoter, the human 
elongation factor 1 ar-P promoter (EF-la-P) , the SRor 
promoter and the mouse metallothienein gene I (mMTl) 
promoter. 

20 Typically, nucleic acid sequences according to the 

invention will be inserted into such vectors in a sense 
orientation. However, nucleic acid sequences according to 
the invention may also be inserted into the vectors 
described above in an antisense orientation in order to 

25 provide for the production antisense RNA . Antisense RNA 

may also be produced by synthetic means. Such antisense < 
RNA may be used in a method of controlling the levels of 
the polypeptide of SEQ. ID. No. 2 or 11 or 13 or a protein 
encoded by a related nucleic acid sequence in a cell. Such 

30 a protein is, for example, a complete IAR-PTP protein. 
Thus, antisense sequences of the invention can form the 
basis of methods of gene therapy aimed at suppressing the 
autoantigenicity of IAR-PTP. 

Vectors according to the invention may be used in 

35 vitro, for example for the production of RNA hybridisable 
to cDNA, of the invention. Such vectors may be used to 
transfect or transform a host cell. Depending on the type 
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of vector, they may be used as cloning vectors to amplify 
DNA sequences according to the invention or to express this 
DNA in a host cell. 

A further embodiment of the invention provides host 
5 cells harbouring one or more vectors and/or nucleic acids 
of the invention. Typically, such cells are transformed or 
transfected with the vectors for the replication and/or 
expression of nucleic acid sequences according to the 
invention, including the DNA of SEQ. ID. No. 1, 10 and 12. 

10 The cells will be chosen to be compatible with the vector 
and may for example be bacterial cells, mammalian, insect 
or yeast cells. 

The cells may be transformed or transfected by any 
suitable method, such as the methods disclosed by Sambrook 

15 et al (Molecular Cloning: A Laboratory Manual; 1989). For 
example, vectors comprising nucleic acid sequences 
according to the invention may be packaged into infectious 
viral particles, such as retroviral particles. The 
constructs may also be introduced by elect roporat ion, 

20 calcium phosphate precipitation, biolistic methods or by 
contacting naked nucleic acid vectors with the cells. in 
solution. 

In the said nucleic acid vectors with which the host 
cells are transformed or transfected, the nucleic acid may 
25 be DNA or RNA, preferably DNA, it may be single-stranded or 
double - stranded . 

The vectors with which the host cells are transformed 
or transfected may be of any suitable type. For example, 
the vectors may be able to effect integration of nucleic 
30 acid sequences of the invention into the host cell genome 
or they may remain free in the host cell. Typically, the 
vectors will be expression vectors, such as a retroviral 
vector or a DNA expression vector as defined herein. For 
example, the vector used for transformation construct may 
35 be a plasmid vector. 

The transformed or transfected cells of the invention 
can be used in a process of production of polypeptides of 
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the invention. Such processes will typically comprise 
transforming or transfecting host cells with vectors 
comprising nucleic acid sequences according to the 
invention and expressing the nucleic acid sequence in these 
5 cells. 

In this case, the nucleic acid sequence will be 
operably linked to a promoter capable of directing its 
expression in the host cell. Desirably, such a promoter 
will be a "strong" promoter capable of achieving high 

10 levels of expression in the host cell. It may be desirable 
to overexpress the polypeptide according to the invention 
in the host cell. 

Suitable host cells for this purpose include 
bacterial cells, for example E.coli cells; yeast cells ; 

15 mammalian cells, for example CHO cells; and insect cells, 
preferably in combination with a baculovirus expression 
system. Eukaryotic cells, such as mammalian, insect and 
yeast cells are preferred in situations where glycosylated 
polypeptide products are desired. 

20 The thus produced polypeptide of the invention may be 

recovered by any suitable method known in the art. 
Optionally, the thus recovered polypeptide may be purified 
or partly purified by any suitable method, for example a 
method according to Sambrook et al (Molecular Cloning: A 

25 Laboratory Manual) . 

The nucleic acid sequences of the invention may be 
used to prepare probes and primers. These will be useful 
in the isolation of nucleic acid sequences having sequences 
similar to that of SEQ. ID. No. 1, 10 or 12 , which may 

3 0 encode polypeptides of the invention. Such probes and 

primers may be of any suitable length, desirably from 10 to 
100, for example from 10 to 20, 20 to 50, 50 to 100 bases 
in length. Typically, the probes and primers will 
correspond exactly to part of the sequence of SEQ. ID. No. 

35 1, 10 or 12 over their entire length, although they may 

vary from this sequence in any of the ways described above. 
Such probes and/or primers may be used, in particular to 
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isolate complete IAR-PTP cDNA and/or genomic DNA sequences 
from the human genome; or from the genomes of other 
organisms . 

The present invention also provides antibodies, 
5 preferably monoclonal antibodies, to the polypeptides of 

che invention. Such antibodies may be useful in methods of 
diagnosing IDDM, as described herein. 

Antibodies of the invention include fragments of 
whole antibodies which retain their binding activity for a 
10 target antigen. Such fragments include Fv,.Fab' and Fab' 2 
fragments, as well as single chain antibodies. 

The antibodies may be produced by any suitable method 
known in the art, such as the methods of Sambrook et al 
(Molecular Cloning: A Laboratory Manual; 1989) . For 
15 example, they may be prepared by conventional hybridoma 
techniques or, in the case of modified antibodies or 
fragments, by recombinant DNA technology, for example by 
the expression in a suitable host vector of a DNA construct 
encoding the modified antibody or fragment operably linked 
20 to a promoter, as described above. Suitable host cells 

include bacterial (for example E.coli) , yeast, insect and 
mammalian cells. 

Therefore, the invention also provides nucleic acid 
sequences encoding antibodies of the invention, vectors 
25 comprising such sequences, cells harbouring such vectors or 
sequences, and methods for expressing antibodies of the 
invention by recombinant techniques. 

Polyclonal antibodies may also be raised, using 
conventional techniques in the art, using the polypeptides 
3 0 of the invention. For example, polyclonal antibodies may be 
raised by inoculating a host animal, such as a rat, mouse 
or rabbit, with a polypeptide of the invention, and 
recovering the immune serum. 

Antibodies according to the invention may be useful in 
35 methods of the invention, as described herein, for the 
diagnosis of IDDM . 

The invention further provides methods of diagnosing 
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IDDM. Thus, for example, polypeptides of the invention can 
be used in methods of diagnosing any aspect of IDDM, For 
example, they may be used in diagnosing the onset; of IDDM, 
the presence of IDDM, or predisposition to IDDM, or in 
5 monitoring the development or progress of IDDM, and in 
methods of predicting the possible future occurrence of 
IDDM. For example, individuals who have, or are 
predisposed to develop, IDDM can be expected to have 
autoantibodies to the autoantigenic IAR-PTP. 

10 In particular. The IAR polypeptides of the invention 

are useful in methods carried out on individuals having a 
high likelihood of developing IDDM, for example, relations, 
especially close relations, more especially first degree 
relations (parents and siblings), of known IDDM sufferers. 

15 For example, the IAR polypeptides of the invention are 

particularly useful in determining the likelihood that a 
close relation of an IDDM sufferer will contract IDDM. 
Close relations include, for example, parents, 
grandparents, siblings, children, grandchildren, cousins, 

20 uncles, aunts, nieces, grand aunts and uncles and g'rand 
nieces and nephews. 

Thus, samples of biological fluid suspected of 
containing such autoantibodies can be taken from 
individuals and contacted with polypeptides of the 

25 invention. The anti-IAR autoantibodies, if present, will 
recognise the polypeptides of the invention and this 
recognition can be detected, for example by detecting the 
IAR/anti-IAR complex thus formed. 

Thus, the methods of the invention typically comprise 

30 a step of determining whether or not recognition of the 
polypeptide of the invention by autoantibodies occurs in 
the future. Similarly, if no such recognition occurs, the 
test suggests that the individual does not have IDDM or is 
unlikely to develop it in the future. 

35 Recognition of polypeptides of the invention by 

antibodies present in such a sample correlates with IDDM or 
a potential for developing IDDM. 
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The methods of the invention are thus typically 
methods of assaying for the presence or absence of 
autoantibodies that recognise IAR. They may also quant itate 
the amount or concentration of such autoantibodies in a 
5 sample . 

Thus, diagnostic methods of the invention typically 
comprise : 

(a) contacting a sample from an individual with a 
polypeptide according to the invention under 

10 conditions that permit the recognition of the 

polypeptide by autoantibodies that recognise IAR; and 

(b) determining whether or not said recognition occurs. 
Any suitable sample from the individual may be used. 

Typically the sample will comprise a biological fluid, 
15 preferably blood, plasma or serum. The sample may be of 
any size. 

The sample is contacted with a polypeptide of the 
invention and the presence or absence of immunoreactivi ty 
between them is determined. 

20 In some embodiments of the invention, a second IDDM- 

related autoantigen may be used. Thus, the invention 
provides methods, as described above, which further 
comprise, in addition to steps (a) and (b) , contacting the 
sample with at least one further IDDM-related autoantigen, 

25 preferably an islet cell antigen (ICA) . This may be done at 
the same time, or at a different time to the contacting 
with the polypeptide of the invention. 

In such methods, it is determined whether or not 
autoantigens recognise the further autoantigen or 

30 autoantigens. The same sample or a different sample from 
the same individual may be contacted with the polypeptide 
of the invention and the further IDDM-associated 
autoantigen. Such a further autoantigen may be, for 
example, hIA-2, GAD, ICA-512, ICA-12, ICA-13, ICA-208, ICA- 

35 302, ICA-313, ICA-525 or ICA-505; or a polypeptide that 
functions as one of these further autoantigens. 

Preferred further autoantigens include: IA-2, 
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preferably hIA-2; ICA-512; and GAD; and antigenic fragments 
thereof. Preferred combinations of autoantigens therefor* 
include: a polypeptide of the invention and one, two or all 
three of IA-2, GAD and ICA-512. Further IDDM-associated 
5 autoantigens may be used, as appropriate, in any of the 
techniques described below. 

Any conventional technique may be used to determine 
whether or not polypeptides of the invention are recognised 
by autoantibodies. This also applies to determining 

10 whether or not any further autoantigens which may be used 
are recognised by autoantibodies. 

For example, suitable immunoassay formats employ a 
combination of solid phase or immobilized reagents and 
labelled reagents whereby the association of the label with 

15 the solid phase is a function of the presence or absence of 
recognition of the polypeptide of the invention cr a 
further autoantigen. In general, such a solid phase 
reagent comprises a binding substance such as a polypeptide 
of the invention, an anti -antibody (e.g., anti-IgG), or 

2 0 other immunobinder or other binding agent according to the 
assay protocol involved, bound or attached, covalently or 
non-covalently , to a solid phase matrix or in an otherwise 
immobilized form. Suitable solid phase matrices are 
conventional in the art and include such matrices as 

25 microtiter plate wells, test tubes and other test 

containers or vessels, test strips, beads, and particles 
such as plastic microparticles and latexes. Where a solid 
phase reagent comprises more than one type of polypeptide 
of the present invention and/or a polypeptide of the 

30 invention and one or more further autoantigens, it will be 
recognised that each reagent can be physically separated or 
isolable from the others or two or more can be mixed or 
otherwise associated in an undifferentiated manner with the 
solid phase. For example, reagents can be immobilised in 

35 separate wells of a microtiter plate or can occupy the sane 
wells; or, where the solid phase is in the form of 
particles, each individual particle can have attached only 
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a single reagent or can have attached two or more reagents. 

Similarly, useful labelled reagents comprise a binding 
substance such as a polypeptide of the invention, a further 
autoantigen, an anti-antibody (e.g., anti-IgG) , or other 
5 immunobinder or other binding agent according to the assay 
protocol involved, which is chemically coupled with a 
detectable chemical moiety. Useful labels are conventional 
in the art and include fluorescers, chemiluminescers , 
radioisotopes, and enzymes. Enzyme labels are particularly 

10 useful and are generally selected from alkaline 

phosphatase, peroxidase, and /?-galactosidase . Enzyme 
labels are readily detectable by addition of a 
corresponding chromogenic substrate and detecting the 
resulting colour or fluorescent response. 

15 One particular immunoassay format that can be applied 

to the present method employs an immobilized form of the 
polypeptides of the invention, and optionally an 
immobilized form of a further autoantigen, as an 
immunoreagent . A test sample is incubated with the solid 

20 phase reagent and preferentially washed to remove unbound 
material. A labelled antibody reagent is then added. Such 
antibody reagents can be specific for a particular class of 
immunoglobulin, e.g., IgG, IgM, IgA, etc., or can be a 
mixture of conjugates so that all immunoglobulin types are 

25 detectable. Islet cell antibodies of interest are 

generally of the IgG isotype and thus anti-IgG would 
normally be employed. The solid phase is washed to remove 
unbound labelled antibody reagent and. the label activity 
remaining on the solid phase is measured qualitatively or 

30 quantitatively. 

A variation of this protocol uses a ligand-modif ied 
form of the polypeptide of the invention, and optionally a 
ligand-modif ied form of a further autoantigen with 
immobilization to the solid phase being accomplished by 

35 using a solid phase bearing a binding partner to the 

ligand. For example, biotin or a hapten (e.g., fluroscein) 
can be used as the ligand and can be rendered immobilized 
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by contact with a solid phase form of avidin or anti-hapten 
antibody, respectively. The addition of the solid phase 
binding partner can occur at any convenient time in the 
assay, such as prior to contact of sample with the ligand- 
5 antigen (s) or thereafter. 

Another immunoassay format that can be applied to the 
present method employs an immobilized form of an antibody 
reagent. Antibody specific for the desired ICA 
immunoglobulin type to be detected (e.g., IgG) , cr a 

10 mixture of antibodies against different IgG isotypes, is 
immobilized on a solid phase and incubated with the test 
sample. Resulting islet cell antibody that has become 
bound to the solid phase antibody reagent can then be 
detected in any suitable manner. For instance, one can add 

15 labelled forms of one or more types of polypeptide of the 
invention and/or of other IDDM autoantigens either as 
individually labelled reagents or as a combined labelled 
reagent. Alternatively, one can add a soluble form of the* 
polypeptide of the invention (and optionally a further 

20 autoantigen) and then, together in the form of a complex, 
or later as a separate reagent addition, labelled antibody 
that can bind with it. Further, one can add a soluble form 
of the polypeptide of the invention (and optionally a 
further autoantigen) which has been modified with a ligand 

25 and then adding a labelled form of a binding partner to the 
ligand. The previously described variation of using a 
ligand-modif ied form of the solid phase reagent, in this 
case, antibody, with immobilization to the solid phase 
being accomplished by using a solid phase bearing a binding 

30 partner to the ligand, can also be used. 

A competitive immunoassay format is also useful. 
Immobilized polypeptide of the invention is employed along 
with a labelled form of islet cell antibodies, for example 
antibodies of the invention as described herein. Labelled 

3 5 antibody and ICA from the test sample are allowed to 

compete for binding to the polypeptide of the invention 
either simultaneously or sequentially, e.g, by exposing the 
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solid phase polypeptide of the invention first to the 
sample and thereafter to the labelled antibody reagent. 
The polypeptide of the invention can be immobilized 
directly to the solid phase or through a ligand-binding 
5 partner bridge as described above, with the immobilization 
step being performed at any convenient time in the assay, 
including as the last step. 

Latex or particle agglutination methods are also 
suitable. Particles are coated or covalently coupled with 

10 the polypeptides of the invention, and optionally with one 
or more further IDDM-associated autoantigens. The 
particles are then incubated with the test sample and 
resulting agglutination of the particles due to the 
formation of ICA-antibody linkages between particles is 

15 detected. Detection can be accomplished, for example, by 
visual observation (a slide agglutination format) or 
quantitated by measuring turbidity change with a 
spectrophotometer or nephelometer . A variation is based on 
inhibition of particle agglutination. Each particle 

20 reagent comprises one or more monoclonal antibodies 

corresponding specifically with one or more particular 
polypeptide of the invention and optionally with one or 
more further IDDM-associated autoantigens, respectively. 
In addition, an agglutinator reagent is prepared comprising 

25 multiple antigens, e.g., a water soluble polymer backbone 
to which are attached multiples of one or more types of 
polypeptides of the invention, and optionally one or more 
further autoantigens. When mixed with sample, the absence 
of autoantibodies in the sample results in agglutination of 

3 0 the particles by formation of bridges formed by the 

agglutinator reagent. When autoantibodies are present, 
they bind to the agglutinator and thereby prevents or 
inhibits agglutination . 

In methods according to the invention, any suitable 

3 5 quantity of polypeptides of the invention and/or of the 
further autoantigens may be used. For example, for each 
test according to the invention, a quantity of from 
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1 x 10" 12 g to 1 x 10" 6 g of a polypeptide of the invention 
and optionally of one or more further autoantigens as 
defined herein may be used. Preferably, quantities of from 
1 x 10" xo g to 1 x 10" 7 g may be used, with quantities of from 
5 10" 9 g to 10" 7 g being more preferred. For example, 

quantities of from 10 to 50 ng, for example from 20 ng to 
30ng may be used (1 x 10" 8 g to 5 x 10" 8 g) . Such quantities 
of polypeptides of the invention and optionally of further 
autoantigens as defined herein, may, for example, be 

10 applied to a given microtiter well in a microtiter plate or 
to any other solid phase matrix. 

Diagnostic methods of the invention are typically 
performed in vitro or ex vivo although, where appropriate, 
in vivo methods are contemplated. 

15 The methods of the invention have a number cf clinical 

applications related to IDDM. For example, the methods of 
the invention are useful in screening of patients from the 
general population, particularly juveniles, wherein a 
positive test result (i.e., a finding that a patient's 

20 blood contains antibodies that bind to a polypeptide of the 
invention) indicates that such a patient is at risk of 
developing IDDM, that is, that such a patient may have a 
pre -IDDM condition. Furthermore, the present method is 
applicable to the screening of first degree relatives 

25 (i.e., siblings and children) of individuals alr€;ady 
diagnosed as having IDDM. Patients with positive test 
results from the methods of the invention could be examined 
in more detail, for example by means of metabolic and 
genetic testing, to determine whether glucose intolerance 

30 has already occurred, or whether additional risk factors 
are present. 

Other applications of the present method include the 
testing of a patieni; already diagnosed with diabetes for 
the purpose of determining if IDDM is involved, for 
35 instance in. the case of a Type II diabetic who may also 
experience the onset of IDDM. 

Furthermore, the methods of the invention are useful 
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in monitoring of diabetic therapy. For example, using the 
methods of the invention, it is possible to monitor the 
effectiveness of any type of treatment of IDDM, for example 
gene therapy, immunosuppressant therapy or immune therapy. 
5 it is also possible to monitor the effects of drug therapy, 
the impact of diet on IDDM, and the effect of other factors 
on IDDM. 

The invention further provides medical uses of 
the polypeptides of the invention. Accordingly, the 

10 invention provides polypeptides of the invention for use in 
methods of treatment of the human or animal body, 
especially the treatment of IDDM; and for use in methods of 
diagnosis of IDDM. In methods of treatment of IDDM, 
polypeptides of the invention may, for instance, be used to 

15 saturate an individual's immune response, blocking the 

individual's autoimmune response to natural IAR-PTP. The 
invention also provides polypeptides of the invention for 
use in methods of treatment of the human or animal body, 
particularly for IDDM, in combination with one or more 

20 further autoantigens as described herein. 

Further, the invention provides the use of 
polypeptides of the invention in the manufacture of 
medicaments for the treatment of the human or animal body, 
especially the treatment of IDDM; and for use in methods of 

25 diagnosis of IDDM, as described herein. Such medicaments 
may comprise a further IDDM-associated autoantigens as 
described herein, or may be administered in combination 
with such a further autoantigen. 

Similarly, the invention provides pharmaceutical 

3 0 compositions comprising a polypeptide of the invention, and 
optionally a further autoantigen as defined herein; and a 
pharmaceutical^ acceptable carrier. 

The invention further provides diagnostic test kits 
comprising polypeptides and/or antibodies of the invention, 

35 optionally together with one or more further IDDM- 
associated autoantigens, as described herein. 

Any suitable quantity of these components may be 
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present in the test kits. Other components may also be 
included in the kits, for example buffers and/or other 
solutions; labels; tubes, plates, membranes, membrane 
sticks or other items of equipment, for example for 
5 immobilising reagents; and/or anti-human antibodies for 
monitoring immunoreact ivity . 

Test kits of the invention typically further comprise 
means for determining whether or not the polypeptides of 
the invention are recognised by autoantibodies. Test kits 

10 comprising one or more further autoantigens typically 

include means for determining whether or not the further 
autoantigen or autoantigens are recognised by 
autoantibodies . 

Test kits according to the invention may, for example, 

15 enable the use of methods according to the invention. Thus, 
they may comprise, as means for determining whether or no: 
the polypeptides of the invention, and optionally one or 
more further IDDM-associated autoantigens as described 
herein, are recognised by autoantibodies, any reagents 

20 suitable for putting into practice the techniques described 
above . 

In test kits of the invention, the components may be 
in any suitable form; for example they may be packaged in 
any suitable container. The components may be intended for 

2 5 separate, for example sequential, use; or for concurrent 

use . 

In test kits of the invention, the components may be 
in any suitable form, for example they may be in sterile 
solutions, buffered solutions or sterile and buffered 
30 solutions. They may be labelled or unlabelled. The 

components may be in substantially purified, purified, 
isolated or substantially isolated form. 

The following Examples illustrate the invention. 

3 5 EXAMPLES 

1. Initial identification of JAR as a PTPase 

Primers corresponding to conserved amino acid 
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sequences (DYINA (SEQ . ID. No. 14) and VHCSAGV (SEQ. ID. 
No. 15) of the catalytic domain of PTPases were designed 
(with additional EcoRI sites added) and used to amplify a 
human colon carcinoma cell (SW480) cDNA library (Clontech, 
5 HL3014b> using PCR. The primers used had the following 

sequences : 5 ' -GGGAATTCNGAYTAYATHAAYGC- 3 ' (SEQ ID No . 3 ) and 
5 ' -GGGAATTCACNCCNGCRCTRCARTGNAC- 3 ' (SEQ ID No. 4). The PCR 
was carried out as follows: 94°C x 5 min --->(94°C x 40 
see >50°C x 30 sec >72°C x 90 sec) x 30 >72°C x 5 min- 

10 -->4°C. The major amplified fragment of ~460bp was 

subcloned into the pGEM-T vector (Promega) and 94 clones 
were sequenced. 

One clone contained an amplified fragment which did 
not have high homology to any known PTPase . This fragment 

15 was selected for further study and was termed r75 . r75 was 
identical over a 312bp portion of its sequence to a human 
brain EST sequence (# 03250) [Adams et al . , 1993, Nature 
Genet. 4, 256-267] which was noted in this paper as having 
80.6% similarity to ICA512). 

20 To examine the size and tissue expression of RNA 

transcripts from which r75 originated, human mulciple 
Northern blots (Clontech) were probed with 32 P- labelled r75 
(Fig. 1). Two major transcripts of about 3 . 7kb and 5 . 5kb 
were observed. The highest levels of expression were 

25 detected in brain and pancreas, followed by prostate and 
testis, with lower expression detectable in a few other 
tissues, including colon. 



2 . Cloning of IAR 

3 0 Based on the expression pattern of transcripts 

recognized by r75, a random primed human pancreas cDNA 
library in Xgtll (Clontech, HL1163b) was screened using r75 
as probe. The r75 was labelled with 32 P-dCTP (Amersham) 
using che High Prime Labeling Kit (Boehringer Mannheim) and 

35 denatured by heating before hybridization. Hybridization 
was carried out overnight at 65°C in 0 . 5M sodium phosphate 
(pH 6.8), 7% SDS, 15% formamide, ImM EDTA. The membrane 
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was washed 3 times at 65°C with 50mM sodium phosphate (pH 
6.8), 0.1% SDS, and exposed to film at -70°C. Positive 
clones were reselected through another 3 rounds of 
subscreening with r75. Continuously positive clones were 
5 subcloned into the EcoRI site of Bluescript SK( + ) 

(Stratagene) and partially sequenced for comparison with 
r75. Some clones were found to be fragments of IA-2 cDNA 
(Lan et al . 1994). One of these clones, D2, contained 
nucleotides 1589-3166 of IA-2. Several other clones were 

10 distinct from IA-2. Of these, the largest clone (C3) 
containing an open reading frame of about 2.5kb was 
selected for sequencing. The C3 clone was digested by Pst;I 
or Apal into shorter fragments. Each fragment was 
subcloned into Bluescript SK(+) and sequenced across both 

15 strands using either primers to the vector or synthesized 
primers hybridizing to the insert in conjunction with the 
Sequenase Version 2.0 Sequencing Kit (United States 
Biochemical Corporation) . 

The C3 clone contained a coding sequence missing 

20 initiation and termination signals, and was significantly 
smaller than than the 3.7 and 5 . 5k±> RNA transcripts 
detected by Northern blotting with r75 (Fig.l), suggesting 
that it was incomplete at both the 5' and 3' ends. The 3' 
end of the clone was obtained by 3' RACE using the Marathon 

25 cDNA Amplification Kit (Clontech) . The first-strand cDNA 
was reverse transcribed from human brain polyA* mRNA 
(Clontech) using the cDNA synthesis primer provided with 
the kit. The gene-specific primer used for 3 ' -RACE was 5'- 
CCTGCCTCCTCAGGCGGAGCAAGA-3 ' (SEQ ID No. 9), corresponding 

30 to nucleotides 1389-1412 of SEQ ID No.l. PCR was; carried 

out at 94°C x 3 min >(94°C x 40 sec >68°C x 4 min) x 30- 

-->4°C using the Expand Long PCR Kit (Boehringer Mannheim) . 
The amplified fragments were cloned into the pGEM-T vector 
(Promega) and partially sequenced for comparison with C3 . 

35 One clone, 3-7, with a region of identical overlap with C3 , 
contained a stop codon and polyadenylation signal and was 
sequenced along both strands by Sanger's dideoxy -mediated 
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chain- termination method. Another clone, 3-10, was about 
2kb larger than 3-7. The 3-10 clone was identical to 3-7 
from the 5' end up to the polyadenylation signal in 3-7, 
but was followed by approximately 2kb of additional 3' 
5 untranslated sequence. 

The combined sequences of C3 and 3-7 comprise a 
partial cDNA encoding a novel putative receptor-like PTPase 
which we term 1AR (for islet cell antigen-related) (Fig. 2). 
The predicted protein fragment (SEQ. ID. No. 2) translated 

10 by the only possible open reading frame comprises an 

incomplete N-terminal extracellular region, a 21 amino acid 
transmembrane -spanning region, and a 37 9 amino acid 
intracellular region. 

To obtain the 5' end of the cDNA, the pancreas library 

15 was rescreened with a 300 bp probe close to the 5' end of 
C3 . No IA-2 clones were found. Of the positive clones, 
clone Bll appeared to have the longest 5' extension to C3 
and was selected for further sequencing along both strands. 
The 3' end of Bll was identical to C3 sequence over a 

20 length of about 2 kb, but the 5' ends of Bll and C3 were 
different (Fig. 2C) . Bll had a unique 5' sequence of 412 
bp, followed by a 54 bp sequence that is found in the 
opposite orientation in C3 . The sequence of C3 found 5' to 
this inverted region was shorter and different from that of 

25 Bll. 5' RACE was used to determine which clone contained 
the correct 5' sequence and to try to obtain additional 5' 
sequence to the open reading frames of Bll and C3 . 
Template cDNA was prepared from human brain polyA* mRNA 
(using the Marathon cDNA amplification Kit (Clontech) using 

30 random primers or a primer (P5, with the sequence 5'- 

CGTGTGGGCCACATAGGTCAGGATGCTCTCGGAGAA- 3 ' (SEQ . ID . No . 20)) 
corresponding to shared B11/C3 sequence located about 170 
bp 3' to the invert. The 5' RACE used a primer to the 
ligated adaptor sequence at the ends of the cDNA and an IAR 

35 specific reverse primer (Pll with the sequence 

5 ' -TGGCGAGCACGTCTGAGGCTG- 3 ' (SEQ. ID. No. 21)) to a region 
of shared B11/C3 sequence located about 90 bp 3 ' to the 
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invert. The amplified fragments were cloned inco pGEM-T 
(Promega) and sequenced along both strands. Cloning and 
sequencing of the 5' RACE PCR products from both types of 
template cDNA showed that all had Bll sequence. Similarly, 
5 PCR amplification of the pancreas cDNA library using a 
primer to Xgtll and either P5 or Pll primers gave DNA 
fragments which all corresponded to the unique Bll 
sequence. Thus the Bll sequence is correct and the 5' end 
of the C3 clone is likely a library artifact. No PCR 

10 products were found with a 5' extension of the Bll 

sequence, and no 5' -extended clones were identified by 
further screening of human brain and pancreas cDNA 
libraries. f 
Although no in- frame stop codons were found 5' to the 

15 first ATG codon in the Bll sequence, we propose chat this 
is the initiation codon for the following reasons: 1) a 
purine (G) in the -3 position conforms with the Xozak rules 
of initiation (Kozak (1989)), and 2) this methionine is 
followed by a 20 residue hydrophobic sequence that has the 

20 features of a signal sequence (Von Heijne 1986) ) . To test 
for signal sequence function, cDNA encoding full-length IAR 
protein was transcribed and translated in vitro in the 
presence of microsomal membranes. Subsequent treatment 
with Proteinase K reduced the size of the translated 

25 product- to a fragment corresponding to the predicted size 

of the extracellular region of IAR (data not shown) . This / 
suggests that the IAR protein is inserted into the 
membranes such that the extracellular region of IAR is in 
the Proteinase K-inaccessible interior of the membrane 

30 vesicles. 

The complete cDNA (SEQ. ID. No. 10) is predicted to 
encode a receptor- like PTP of 1015 amino acids (SEQ. ID. 
No. 11) . This includes a 614 amino acid extracellular 
region, a 2 5 residue hydrophobic transmembrane segment, a.nd 
35 a 376 amino acid intracellular region containing a single 
PTP catalytic domain (SEQ. ID. No. 11). This protein ha=: 
about 43% overall sequence identity to the putative PTP IA- 
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2 (Lan et al (1994); Rabin et al (1994)), a protein 
identified as an islet cell auto-antigen in IDDM, and 
another PTP-like molecule called PTPLP ( (Kambayashi et al 
(1995)). We have thus named the novel PTP as islet cell 
5 antigen-related PTP or IAR PTP. The IAR extracellular 

region does not contain FN- III or Ig-like repeats, but has 
the adhesion recognition peptide sequence RDGS (amino acids 
372-375) . There is one potential site for N-linked 
glycosylation . The catalytic domain of IAR has conserved 

10 sequences typical of other PTPases, although the active 

site is unusual in having an aspartate residue at position 
947 ( I VHCSDGAGRTG (SEQ. ID. No. 16)) in place of a 
conserved alanine residue. 

In view of the homology of IAR and IA-2, we selected 

15 non- homologous fragments of the C3 and D2 clones to use as 
specific probes on Northern blots in order to determine 
whether distinct transcripts were identified. The IAR probe 
comprised nucleotides 790-1197 of SEQ. ID. No.l 
(corresponding to a part of the predicted extracellular 

20 region) , and the IA-2 probe comprised nucleotides 1665-1906 
of IA-2 cDNA (Lan et al . , 1994), corresponding to a part of 
the extracellular region, the transmembrane region, and a 
small portion of the intracellular region. These probes 
.were labelled with 32 P as described in the legends to Figs. 

25 3 and 4. A set of human multiple tissue Northern blots 
(Clontech) was probed first with the IA-2 probe (Fig. 3), 
and later stripped and reprobed with the IAR probe (Fig. 
4). The two probes recognize . distinct transcripts in terms 
of size and tissue specificity. The IA-2 probe detects a 

30 single transcript of about 3 . 9kb (Lan et al . , 1994), 

whereas the IAR probe detects two transcripts of about 3.7 
and 5.5kb. It is possible that the 3.7kb transcript has a 
3' untranslated sequence corresponding to that of the 3-7 
clone, while the larger 5.5kb transcript represents an 

35 alternative 3' untranslated sequence corresponding to that 
in the longer 3-10 clone. The highest expression of IA-2 
is detected in brain, followed by spinal cord and pancreas, 
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with low levels expressed in small intescine and adrenal 
gland. The highest expression of IAR is in pancreas and 
brain, followed by trachea, prostate, stomach and spinal 
cord, with low levels detectable in small intestine and 
5 adrenal gland. Figure 5 shows an alignment of the IAR and 
IA-2 transcripts detected in brain and pancreas. It is 
obvious that the IA-2 transcript is larger than the 
smallest 3 . 7kb IAR transcript, and furthermore the 3 . 7kb 
IAR transcript is much more highly expressed in pancreas 
10 than brain while the 3 . 9kb IA-2 transcript is more highly 

expressed in brain than pancreas. Thus IAR and IA-2 appear 
to be encoded by distinct genes. 

3 . Expression and analysis of the intracellular region of 
15 IAR 

The intracellular region of IAR was amplified by PCR 
using the 3-7 clone as template. The primers used had the 
sequences 5 ' - GGGCTCGAGTCTAGACAGGCTGAAGGAGAAGCTCTC - 3 ' (SEQ 
ID No. 5) (corresponding to nucleotides 1517-1538 of SEQ. 

20 ID. No.l, with added Xbal and Xhol sites) and 5'- 
GGGGAATTCCATGGTTATAATAGAAGACACACA-3 ' (SEQ ID No. (i) 
(corresponding to nucleotides 2704-2723 of SEQ. ID. No.l, 
with added EcoRI and Ncol sites) . PCR was carri€id out at 
94°C x 5 min >(94°C x 40 sec >60°C x 30 sec >72°C x 2 

25 min) x 10--->72°C x 5 min >4°C using the Expand Long PCR 

Kit (Boehringer Mannheim) . The amplified fragment was 
cloned into the Xbal and Ncol sites of pGEX-KG (Guan and 
Dixon r 1991) and sequencing of the entire insert confirmed 
that no mutations had resulted from PCR. Later, the insert 

3 0 was cut out of this plasmid with Xhol and EcoRI and cloned 
into these sites in pGEX-3C (Walker et al . , 1994). 
Mutant IAR (C945S) was generated by PCR using 
overlapping forward and reverse primers corresponding to 
the desired mutation and surrounding sequence encoding 

3 5 IIVHSSDGA (SEQ. ID. No. 19) ; 

5 ' - ATAATTGTTCATTCCAGTGACGGTGCA- 3 ' ( SEQ . ID. No . 17) and 
5 ' - TG C AC CG T C ACTGG AATG AAC AATT AT - 3 ' ( SEQ . ID. No. 18). Each 
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primer was used in combination with one of the primers (of 
SEQ. ID. No. 5 or 6) {used to amplify the intracellular 
reaion of IAR) to generate two overlapping DNA fragments 
corresponding to nucleotides 1993-2904 and 2878-3197 of 
5 IAR. These fragments were mixed and used as template for a 
?CR reaction with the primers of SEQ. ID. No. 5 and 6. 
The above plasmids were transformed into E.coli 
(DH5aF')< The recombinant bacterial host was cultured and 
GST- IAR expression induced with 0.15mM IPTG. The . GST- IAR 

10 was purified from bacterial lysates by standard methods 
(Smith and Johnson, 1988). The IAR was cleaved from GST 
using thrombin or 3C protease as appropriate (Smith and 
Johnson, 1988; Walker et a J . , 1994), and the purified IAR 
quantitated by SDS-PAGE followed by densitometric scanning 

15 alongside known amounts of standards as described (Wang and 
Pallen, 1991) . Cleavage with thrombin of GST- IAR produced 
from the pGEX-KG plasmid resulted in a major protein band 
on SDS-PAGE which migrated with an apparent size of about 
3 6kD (Fig. 6A, lane 4) . This was smaller than the 

20 predicted size of the expressed IAR insert, suggesting that 
internal cleavage of the IAR polypeptide by thrombin might 
have occurred. This led to recloning of the insert into 
the pGEX-3C vector as described above. Cleavage with 3C 
protease of GST- IAR produced from this plasmid resulted in 

25. a major protein band on SDS-PAGE which migrated with the 
expected size of about 40kD (Fig.6A, lane 3). 
The purified 40kD form of the IAR intracellular region was 
analyzed for phosphatase activity. Low but detectable 
phosphatase activity towards the substrate para-nitrophenyl 

30 phosphate was measured (Fig. 6B) . Since the related 
molecule IA-2 reportedly has no detectable phosphatase 
activity (Rabin et al . , 1994; Lu et al . , 1994), we examined 
this in side-by-side experiments with IAR . 

35 4. Expression and analysis of the intracellular region of 
IA-2 

The intracellular region of IA-2 was amplified by PCR 
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using the D2 clone as template. The primers used had the 
sequences 5 ' -GGGCTCGAGTCTAGAGCGGCAGCAAGACAAGGAG- 3 ' (SEQ ID 
No. 7) (corresponding to nucleotides 1885-1903 of IA-2 (Lan 
et slI . , 1994), with added Xbal and Xhol sites) and 
5 5 ' -GGGGAATTCGAGCTCGCATGCCCAAGAGGTGGC- 3 ' ( SEQ ID No. 8 ) 

(corresponding to nucleotides 3104-3121 of IA-2, with added 
EcoRI and SacI sites) . PCR was carried out as described 
above for the IAR intracellular region. The amplified 
fragment was cloned into the Xbal and SacI sites of pGEX-KG 

10 (Guan and Dixon, 1991) . Later, the insert was cut out of 
this plasmid with Xhol and EcoRI and cloned into these 
sites in pGEX-3C (Walker et al . , 1994). 

Recombinant bacterial GST- IA-2 protein was produced 
from both the above plasmids, cleaved, purified and 

15 quantitated as described above for IAR. However, unlike 
IAR, the type of protease used for cleavage from GST did 
not affect the size of the IA-2 produced, with the purified 
IA-2 migrating on SDS-PAGE in both cases as a major band 
with an apparent size of 44kD (Fig.6A, lanes 1 and 2). 

20 Thus the IAR and IA-2 proteins differ in that IAR appears 
to possess an internal thrombin cleavage site which is 
absent in IA-2. 

5. Catalytic activity of IAR 

25 The purified 44kD form of IA-2 (produced by cleavage 

with 3C protease) possessed phosphatase activity towards 
para-nitrophenyl phosphate which was roughly comparable to 
that of IAR (Fig. 6C) . 

The intracellular region of IAR (amino acids 646-101.5) 

30 was expressed as a GST-fusion protein and purified 

following 3C protease cleavage from GST. The IAR migrated 
on SDS-PAGE as a major protein band of about 41 kDa (Fig. 
7A, lane 2), in accord with the predicted size of 41.7 kDa. 
The purified IAR possessed low but detectable phosphatase 

3 5 activity towards pNPP . The IAR is active over a narrow pH 
range, with optimal activity at pH 4.5, and is essentially 
inactive at pH 5.5 (Fig.7B). IAR catalyzes the cime- 
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dependent dephosphory lat ion of pNPP (Fig. 7C) with a 
specific activity of 21 nmol/min/mg. Like many PTPs, IA3 
activity is sensitive to inhibition by vanadate, and lmM 
sodium orthovanadate completely abolished activity (Fig. 
5 7C) . Site-directed mutagenesis of the essential cysteine 
residue (to a serine) in the active site resulted in the 
expression of an IAR intracellular region polypeptide which 
did not catalyze pNPP hydrolysis (Fig. 7C) , indicating that 
the phosphatase activity measured with the wild type 
10 polypeptide was unlikely to be due to contaminants in the 
protein preparation. IAR dephosphorylated phosphotyrosy 1 
casein but not phosphoseryl casein. 



6. Reactivity of IAR with IDDM Patient Sera 

15 

Patients and Sera. Sera were obtained from healthy 
blood bank controls (n«10) and from recent -onset IDDM 
patients (n=20, age 4.0-15.7 years) prior to commencement 
of insulin therapy. ICA were measured by indirect 

20 immunofluorescence on frozen sections of human pancreas 

(Bonifacio et aJ (1990) ) , and GAD antibodies were measured 
by enzymatic immunoprecipitation (Schmidli et al (1994)). 

In Vitro Translation and Immunoprecipitation Tests 
with Patient Sera- cDNAs encoding amino acids 64 6-1015 of 

25 IAR (representing the intracellular region) or full-length 
IA-2 protein (Lan et al (1994)) were subcloned into the 
pGEM-3 vector, under the control of the SP-6 promoter. 
Radiolabeled IAR and IA-2 proteins were synthesized by in 
vitro translation and transcription in the presence of 3S S- 

30 methionine, using a reticulocyte lysate system (Promega TNT 
Kit) . For immunoprecipitations, l/il of lysate containing 
labelled IAR or IA-2 in 100/il TBST-BSA (50mM Tris-HCI, 
150mM NaCl, 1% Tween-20 # 0.1% BSA) was incubated cvernigh: 
at 4 °C with 2/il of serum which had been prebound to lO/il 

3 5 (packed bead volume) Protein A-Sepharose (Pharmacia) . 

Beads were washed once with TBST-BSA, once with high salt 
wash buffer (0.5% Triton X-100, 200mM NaCl , 50mM NaH.POj , 
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:wice in PBS plus 0.5% Triton X-100 and once in 0.5% Triton 
X-1C0, 4 . 17mM Tris, 192mM glycine, pH 7.4. Precipitated 
proteins were resolved by 10% SDS-PAGE. Gels were dried 
and exposed to a phosphor imager , and bands were quantified 
5 by densitometry after subtraction of local background. 

Sera from 10 healthy blood bank control and 20 recent-onset 
IDDM subjects (ICA, GAD antibodies, age and sex of subjects 
are shown in Table 1) were incubated with in vitro 
translated 3S S-methionine labelled IAR (amino acids 646- 

10 1015). Immunoprecipitates were analyzed by SDS-PAGE, where 
the translated IAR protein appeared as a band of 41 kDa 
with an associated proteolytic breakdown product of 32 kDa. 
Representative immunoprecipitat ions of IDDM and control 
sera are shown in Fig. 8A. Quantitative analysis of a 

15 panel of sera showed that band densities were greater than 
the mean +2SD of the 10 control sera (1818 density units) 
in 11 of the 20 (55%) recent-onset IDDM sera (Table 1) . 
There was no significant correlation, in this small number 
of subjects, between IAR and ICA, GAD antibodies, or age. 

2 0 The IAR reactivity of the control and IDDM subjects is 

shown graphically in Fig. 8B . Sera from 9 of the 11 IAR- 
reactive patients also immunoprecipitated IA-2, some 
reacting more strongly with IAR than IA-2 and the inverse 
seen with others (Table 1) . Of interest is the finding 

25 that 3 IDDM subjects had antibodies which 

immunoprecipitated IAR or IA-2, but not both (Table 1, 
subjects 4, 13, 20). Thus IAR and IA-2 can distinguish 
different autoantibody populations in a subset of IDDM 
patients . 
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Table 1. Reactivity of sera with ICA, GAD, IAR and IA-2 



Patient 


Age 


Sex 


ICA 


GAD 


IAR 


IA-2 


Control 










- 1039 


- 1198 


2 










-8 


366 


3 










734 


-350 


4 










1611 


122 


5 










58 


-2665 


6 










412 


- 129 


7 










64 2 


*7 A O 
/4 b 


8 










388 


66 9 


9 










*T C C 


- o ^ 


10 














IDDM 1 


12.1 


F 


0 


209 


1092 


508 


2 


5.5 


M 


160 


522 


1084 


-1326 


3 


6.0 


M 


140 


180 


312286 


13491. 


4 


13 .1 


F 


42 


272 


7963 


318. 


5 


NA 


NA 


NA 


NA 


2148 


12990', 


6 


4 . 0 


F 


270 


3291 


365 


170 


7 


12 . 9 


F 


0 


238 


428 


6 91- 


3 


13 . 4 


M 


84 


184 


376 


470 


9 


12 . 8 




270 


3335 


21802 


18732, 


10 


10. 7 


M 


63 


283 


5392 


5626 


11 


12 . 5 


p 


32 


1713 


631 


-567 


12 


NA 


NA 


NA 


NA 


150105 


16493 


13 


9 . 8 


F 


120 


839 


4344 


-140 


14 


10.5 


M 


270 


167 


17107 


37610 


1 5 


3 . 8 


C 

r 




a a ~\ 




i ni As 

lUlOO 


16 


7 . 0 


M 


270 


199 


160 


-1365 


17 


NA 


NA 


NA 


NA 


19517 


11265 


18 


4 .6 


M 


270 


126 


21006 


12713 


19 


9.6 


M 


26 


235 


1232 


1726 


20 


15 . 7 


F 


160 


1959 


-378 


4717 



10 



15 



20 



25 



30 



35 



8- Comparison of the predictive value of IAR and IA-2 
antibodies for IDDM (see Figures 9 and 10) 

Sera from recent -onset IDDM subjects (n=53) were 
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obtained piror to commencement: of insulin therapy. The 
healthy control group (n=144) consisted of laboratory 
volunteers and healthy schoolchildren. First -degree 
relatives of subjects with IDDM (n=3315) were enrolled in 
5 the Melbourne Prediabetes Family Study and had ICA and IAA 
measured at entry. These subjects form an "at-risk" group 
in that the risk of developing IDDM in ICA positive 
relatives has been previously shown to be 3 5% in five years 
and 60-70% in 10 years (reviewed in Bingley et al (1993) . 

10 As an alternative to population screening, we have studied 
this well-defined at risk-group of subjects who have been 
prospectively followed, to compare the predictive power of 
IAR Ab and IA-2 Ab. 

Measurement of IAR Ab and IA-2 Ab in these sera has 

15 allowed us direct comparison of the abilities of these 

assays to predict IDDM, using identical assay formats and 
standardised reference ranges. In total, 33 of these 
subjects have developed IDDM. IAR and IA-2 antibodies were 
measured in 115 subjects with ICA >. 20 and/or IAA > 100, 

20 who have been followed for 3.52 years (mean, range 0.30- 
7.28 years). Of this group, 18 progressed to IDCM during 
follow-up over 2.28 years (range 0.3-4.52 years). 

IAR and IA-2 antibody radiobixiding assays 

25 

Coupled in vitro transcription and translation was 
used to synthesise 35 S- labelled protein, which was then 
immunoprecipitated with sera using a modification of the 
method of Bonifacio et al (1995) . Complementary DNA (cDNA) 

30 encoding full length human IAR was cloned into the 

pBluescript vector under control of the T3 promoter; cDNA 
encoding amino acids 646-1015 of IAR, representing the 
intracellular region was inserted into the pGEM3 vector 
under control of the SP6 promoter and cDNA encoding full- 

3 5 length IA-2 was inserted into the pSP64 poly A vector under 
control of the SP6 promoter. Recombinant protein was 
synthesised using the Promega TNT reticulocyte lysate kit 
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(Promega, Madison, Wis., USA), in the presence of ib S 
methionine, according to the manufacturers' instructions, 
using the appropriate RNA polymerase. Two fig DNA template 
was used in all in vitro translation and transcription 
5 reactions, which proceeded for two hours at 3 0°C. 

The transcription/translation product (50^1) was 
diluted into 12ml TBST-BSA buffer (50mM Tris-HCl pH 7.2, 
ISOmM NaCl, 1% Tween 20, ImM L-methionine, 0.1% BSA) , and 
50^1 of this dilute lysate was added to Sfxl test serum and 

10 incubated overnight at 4°C in a 96 -well plate (well volune 
2ml, Beckman, Palo Alto, CA, USA). Fifty fil of protein A 
Sepharose suspension (50mg/ml dry weight) in TBST-BSA was 
added and the plate was shaken on a plate shaker for Ih at 
4°C. Beads were washed x3 with 1ml TBST-BSA in the plate. 

15 Immunoprecipitates were transferred to a 96-well Unifilter 
plate (Packard, meriden, CT, USA) , using a multichannel 
pipette, in 100/xl TBST-BSA. Scintillation fluid (30^1 
Microscint 40, Packard) was added to the wells and 
radioactivity was counted in a TopCount Microplate 

20 Scintillation Counter (Packard) . Results were expressed as 
arbitrary units, calculated using the formula: 

Results ((sample - negative control) / (positive control - 
negative control) ) . 

25 

The negative control consisted of pooled normal serum 
and the positive control was strongly IAR and IA-2 positive 
plasmapheresis fluid from a patient with stiff-man syndrome 
and diabetes. The inter assay coefficient of variation of 
30 the IAR antibody assay was 5.8% for a sample with a mean 

value of 57.3 units, and 12.5% for the IA-2 Ab assay, using 
a sample with a mean value of 96.8 units. 

ICA and IAA assays 

35 

ICA were measured by indirect immunofluorescence on 
frozen sections of blood group O human pancreas (Bonifacio 
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et al (1990)) and assigned a value in JDF unics by dilution 
and comparison with standard sera. This assay has been 
standardised in all Immunology of Diabetes and Proficiency 
Workshops. Insulin autoantibodies were measured by a 
5 competitive fluid phase radiobinding assay (Vardi et al 
(1989) ) . 

Intravenous glucose tolerance test (IVGTT) 



10 The Islet Cell Antibody Registry Users Study (ICARUS) 

recommended protocol was used for the IVGTT, as previously 
described {McNair et al (1995)). Serum insulin was 
measured using a commercial RIA kit (Pharmacia, Uppsala, 
Sweden) . 

15 

Statistical analysis 



Associations between parameters were assessed using 
regression analysis. To standardise the specif icty of 

20 antibody assays for comparison of sensitivity, the 

threshold for positivity of each assay was adjusted to 
derive a specificity of 97-98%. This level of specificity 
was chosen because it detects a sufficiently small number 
of false positives to be useful in population screening. 

25 The effect of IAR and IA-2 antibody positivity on IDDM-free 
survival was analysed using the log-rank test. 

Results 



30 Before comparing IAR Ab with IA-2 Ab, the partial 

(intracellular domain) and full-length IAR clones were 
compared for recognition by IDDM sera, to determine whether 
additional reactivity was present on the extracellular 
domain of the full-length IAR molecule that was not present 

35 in the intracellular (cytoplasmic) domain. Then: was a 

strong relationship between levels of antibodies to full- 
length IAR, and antibodies to the intracellular domain 
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ir=94.9%, p<0.0001). Use cf a standardised specificity of 
97.9% resulted in a sensitivity of 58.1% for antibodies to 
the cytoplasmic domain and 48.8% for antibodies to the 
full-length molecule. Because of the strong relationship 
5 between antibody levels and the similar assay 

sensitivities, the intracellular fragment was used as the 
antigen in the remaining experiments. 

IAR Ab and IA-2 Ab were measured in S3 recent -onset 
IDDM and 144 healthy control subjects. The reference range 

10 for each assay was adjusted to give a specif if city of 97.9% 
for each assay, which was 0.5 units for IAR Ab and 3.0 
units for IA-2 Ab. The corresponding sensitivities were 
56.6% for IAR Ab and 62.3% for IA-2 Ab (Table 2) . 
Sensitivities and specificities of combinations of IAR Ab 

15 and IA-2 Ab, and IAR Ab or IA-2 Ab, using the reference 
ranges described above, are also shown in Table 2. The 
presence of either antibody resulted in a sensitivity of 
73.6% and a specificity of 96.5%, and the presence of both 
antibodies gave a specificity of 100% and a sensitivity of 

20 45.3%. Although levels of IAR Ab and IA-2 Ab were 

correlated in recent -onset subjects (r=0.7, p<0.0001, 
Figure 9) and there was a degree of overlap, some patients 
were positive for one antibody only. Thus, six (11.3%) 
were positive for IAR Ab and negative for IA-2 Ab, and nine 

25 (17%) were positive for IA-2 Ab and negative for IAR Ab. 

In some cases there were marked differences between levels 
of IAR Ab and IA-2 Ab, for example there were six sera with 
IA-2 Ab levels greater than 50 units and IAR Ab less than 5 
units, and one with IAR Ab of 25 units and IA-2 Ab of 2.9 

30 units. 

We next examined the ability of IAR Ab and IA-2 to 
discriminate between IDDM progression and non-progression, 
in our prospective study of first-degree relatives with ICA 
> 20JDFU or IAA > lOOnU/ml. IAR and IA-2 antibody 
35 positivity and combinations of these antibodies are 

summarised in Table 2. Of the 18 subjects who progressed 
co IDDM, IAR Ab were detectable in nine (50.0%) and IA-2 Ab 
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in 10 (55.6%) 2.6 + 1.8. (mean, range 0.3-6.3) years prior :o 
diagnosis, using the threshold for posit ivity defined 
above. Of the 97 nonprogressors , only 21 (21.6%) had IAR 
antibodies, whereas 31 (31.9%) had IA-2 antibodies. The 
5 difference in IDDM-free survival between IAR Ab positive 
and negative subjects was statistically significant 
(p<0.001 log-rank test), whereas IA-2 Ab positivity did not 
significantly influence IDDM-free survival. The 
combination of either IAR or IA-2 antibodies did not 

10 significantly affect IDDM-free survival, but the presence 
of both antibodies had a significant effect (p<0.0005). 
Although IA-2 Ab positivity did not affect IDDM-free 
survival when these standardised thresholds of positivity 
were used, the use of higher thresholds for IA-2 Ab 

15 positivity resulted in significant differences. For 

example, when nine units was used as the threshold, IA-2 Ab 
positivity was significantly associated with progression to 
IDDM (p<0.0005, log-rank test). 

In addition to predicting the development of IDDM, IAR 

20 Ab were associated with markers of high risk of IDDM, in 

relatives with ICA or IAA. Insulin autoantibodies (Ziegler 
et ai (1989)) and impaired intravenous glucose tolerance 
(Vardi et al (1991) ) have both been previously shown to be 
associated with progression to IDDM in ICA positive first- 

25 degree relatives. IAA were significantly correlated with 
IAR antibodies (r=0.25, p=0.005) but not with IA-2 
antibodies (r=0.11, N.S.). Both IAR and IA-2 antibodies 
were negatively correlated with the sum of the 1 and 3 
minute insulin levels during an intravenous glucose 

30 tolerance test (IAR: r=0.27, p<0.01; IA-2: r=0.25, p<0.05). 
In addition to these relationships, both IAR and IA-2 
antibodies were strongly correlated with ICA levels (IAR: 
r=0.31, p<0.001, IA-2 : r=0 . 38) . 

There were some subjects in whom there were marked 

35 differences in the level of each antibody. Furthermore, in 
our group of high-risk ICA or IAA positive first degree 
relatives, the presence of IAR Ab was associated with 
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progression :o IDDM whereas TA-2 Ab was not.. Neither 
combinations of IAR Ab or IA-2 Ab, nor IAR Ab and I A- 2 Air- 
resulted in better prediction of IDDM than IAR Ab alone in 
the prospective study; thus,, combined measurement of PTP 
5 antibodies does not appear to be preferable to measurement 
of IAR Ab alone in IDDM screening programmes. In addition 
to discriminating subjects who progressed to IDDM, IAR Ab 
were correlated with other markers of progression to IDDM, 
namely IAA and impaired first-phase insulin release. These 
10 results provide evidence that IAR Ab are markers of P cell 
destruction . 

Levels of antibodies against the cytoplasmic domain of 
IAR were closely correlated with those against the full- 
length clone, and sensitivities of both assays were 

15 similar, suggesting that antibodies from IDDM patients 
recognise the cytoplasmic domain of IAR. 

In conclusion, we have demonstrated that IAR Ab have 
similar diagnostic sensitivity for recent-onset IDDM to IA- 
2 Ab, but are better predictors of IDDM in our high-risk 

20 subjects. As preclinical intervention therapies in IDDM 
are critically reliant on effective screening tests to 
define high-risk subjects, IAR Ab may well have an 
important role in IDDM screening, either alone or in 
combination with other humoral markers of IDDM. 

25 
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TABLE 2 



IAR Ab IA-2 Ab. IAR Ab or IAR Ab 

or & 
IA-2 Ah* IA-2 Ab 



10 Positivity threshold 

(units) <0 . 5 <3 . 0 
Recent-onset IDDM 

Sensitivity {%) 56.6 62.3 73.6 45.3 

Specificity (%) 97.9 97.9 96.5 100.0 

15 ICA positive 

first- degree relatives 

% Progressors positive 55.6 50.0 61.1 44.4 

% Nonprogressors positive 21.6 32.0 39,2. 13.4 

pdogrank test) <0.001 N.S. N.S 0.0005 

20 ■ 



Table 2 Comparison of diagnostic sensitivity and predictive 
power for IDDM, of IAR Ab, IA-2 Ab and combinations of IAR 
Ab and/or IA-2 Ab . 
25 Thresholds for positivity described in text 

/ 
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SEQUENCE INFORMATION 

SEQ - ID. Nos. 1 and 2: partial cDNA sequence of IAR-PTP (SEQ ID 
NO. 1) and encoded amino acid sequence (SEQ ID NO. 2) . 

AGT CGG GAG GGC GGT GCT GCC CTG GCC AAC GCC CTC CGA CGC CAC CTG 

Ser Arg Glu Gly Gly Ala Ala Leu Ala Asn Ala Leu Arg Arg His Leu 
5 10 15 

CCC TTC CTG GAG GCC CTG TCC CAG GCC CCA GCC TCA GAC GTG CTC GCC 

Pro Phe Leu Glu Ala Leu Ser Gin Ala Pro Ala Ser Asp Val Leu Ala 
2 0 25 3 0 

AGG ACC CAT ACG GCG CAG GAC AGA CCC CCC GCT GAG GGT GAT GAC CGC 

Arg Thr His Thr Ala Gin Asp Arg Pro Pro Ala Glu Gly Asp Asp Arc 
2b 40 45 

TTC TCC GAG AGC ATC CTG ACC TAT GTG GCC CAC ACG TCT GCG CTG ACC 

Phe Ser Glu Ser. lie Leu Thr Tyr Val Ala His Thr Ser Ala Leu rhr 
50 55 60 

TAC CCT CCC GGG TCC CGG ACC CAG CTC CGC GAG GAC CTC CTG CCG CGG 

Tyr Pro Pro Glv Ser Arg Thr Gin Leu Arg Glu Asp Leu Leu Pro Arg 
65 "70 75 80 

ACC CTC GGC CAG CTC CAG CCA GAT GAG CTC AGC CCT AAG GTG GAC AGT 

Thr Leu Glv Gin Leu Gin Pro Asp Glu Leu Ser Pro Lys Val Asp Ser 
85 90 95 

GGT GTG GAC AGA CAC CAT CTG ATG GCG GCC CTC AGT GCC TAT GCT GCC 

Glv Val Asp Arg His His Leu Met Ala Ala Leu Ser Ala Tyr Ala Ala 
100 105 110 

CAG AGG CCC CCA GCT CCC CCC GGG GAG GGC AGC CTG GAG CCA CAG TAC 

Gin Arg Pro Pro Ala Pro Pro Gly Glu Gly Ser Leu Glu Pro Gin Tyr 
115 120 125 

CTT CTG CGT GCA CCC TCA AGA ATG CCC AGG CCT TTG CTG GCA CCA GCC 

Leu Leu Arg Ala Pro Ser Arg Met Pro Arg Pro Leu Leu Ala Pro Ala 
130 135 140 

GCC CCC CAG AAG TGG CCT TCA CCT CTG GGA GAT TCC GAA GAC CCC TCC 

SUBSTITUTE SHEET (RULE 26) 
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Ala Pro Gin Lys Trp Pro Ser 160 



Pro Leu Gly Asp Sor Glu Asp 1-ro Sei 



155 



AGC ACA GGC GAT GGA GCA CGG ATT CAT ACC CTC CTG AAG GAC CTG CAG 
Ser Thr Gly Asp Gly Ala Arg He His^Thr Leu Leu Lys Asp^Leu Gin 



165 



AGG CAG CCG GCT GAG GTG AGG GGC CTG ACT GGC CTG GAG CTG GAC GGC 

Arc Gin Pro Ala Glu Val Arg Gly Leu Ser Gly Leu Glu Leu Asp Gly 

180 185 
ATG GCT GAG CTG ATG GCT GGC CTG ATG CAA GGC GTG GAC CAT GGA GTA 

Met Ala Glu Leu Met Ala Gly Leu Met Gin Gly Val Asp His Gly Val 
!95 200 2Cb 

GCT CGA GGC AGC CCT GGG AGA GCG GCC CTG GGA GAG TCT GGA GAA CAG 
Ala Arc Gly Ser Pro Gly Arg Ala Ala Leu Gly Glu Ser Gly Glu Gin 

210 " 215 "° 

GCG GAT GGC CCC AAG GCC ACC CTC CGT GGA GAC AGC TTT CCA GAT GAC 

Ala Asu Gly Pro Lys Ala Thr Leu Arg Gly Asp Ser Phe Pro Asp Asp 
225 230 235 

GGA GTG CAG GAC GAC GAT GAT AGA CTT TAC CAA GAG GTC CAT CGT CTG 

Gly Val Gin Asp^sp Asp Asp Arg Leujyr Gin Glu Val Hi^Arg Leu 

ACT GCC ACA CTC GGG GGC CTC CTG CAG GAC CAC GGG TCT CGA CTC TTA 

S^r Ala Thr Leu Gly Gly Leu Leu Gin Asp His Gly Ser Arg Leu Leu 

2S0 265 270 

CCT GGA GCC CTC CCC TTT GCA AGG CCC CTC GAC ATG GAG AGG AAG AAG 

Pro Glv Ala Leu Pro Phe Ala Arg Pro Leu Asp Met Glu Arg Lys Lys 

275 280 285 

TCC GAG CAC CCT GAG TCT TCC CTG TCT TCA GAA GAG GAG ACT GCC GGA 
Ser Glu His Pro Glu Ser Ser Leu Ser Ser Glu Glu Glu Thr Ala Gly 

GTG GAG AAC GTC AAG AGC CAG ACG TAT TCC AAA GAT CTG CTG GGG CAG 

Val Glu Asn Val Lys Ser Gin Thr Tyr Ser Lys Asp Leu Leu Gly Gin 
305 310 3 15 

CAG CCG CAT TCG GAG CCC GGG GCC GCT GCG TTT GGG GAG CTC CAA AAC 
Gin Pro Kis Ser Glu Pro Gly Ala Ala Ala Phe Gly Glu Leu Gin Asn 
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CAG ATG CCT GGG CCC TCG /JVJ GAG GAG GAG AGC CTT CCA GCG GGT GC7 

Gin Met Pro Glv P::c: ilo:' Lv^ Gin Glu Gin Ser Leu Fro Ai«'-: Gly Ala 
3 4 0 * ' :^ ! - 350 

CAG GAG GCC CTC AGC GAG GGC CTG CAA TTG GAG GTC CAG CCT TCC GAG 

Gin Glu Ala Leu Ser hsp Gly Leu Gin Leu Glu Val Gin Pro Ser Glu 
355 360 365 

GAA GAG GCG CGG GGC TAC ATC GTG ACA GAC AGA GAC CCC CTG CGC CCC 

Glu Glu Ala Arg Gly Tyr lie Val Thr Asp Arg Asp Pro Leu Arg Pro 
370 375 380 

GAG GAA GGA AGG CGG CTG GTG GAG GAC GTC GCC CGC CTC CTG CAG GTG 

Glu Glu Gly Arq Arg Leu Val Glu Asp Val Ala Arg Leu Leu Glr: Val i 
385 " 390 395 4-.00 

CCC AGC AGC GCG TTC GCT GAC GTG GAG GTT CTC GGA CCA GCA GTG ACC 

Pro Ser Scr Ala Phc Aic= Asp Val Glu Val Leu Gly Pro Ala Vai r .?hr 
4 0 5 4 1 C 4 15 

TTC AAA GTG AGC GCC AAT GTC CAA AAC GTG ACC ACT GAG GAT GTG GAG 

Phe Lys Vai Ser Ala Asn Val Gin Ash Val Thr Thr Glu Asp Val Glu 
420 425 43 0 

AAG GCC ACA GTT GAC AAC AAA GAC AAA CTG GAG GAA ACC TCT GGA CTG 

Lys Ala Thr Val Asp Asn Lys Asp Lys Leu Glu Glu Thr Ser Gly .Lieu 
42b 440 445 

AAA ATT CTT CAA ACC GGA GTC GGG TCG AAA AGC AAA CTC AAG TTC CTG 

Lys lie Leu Gin Thr Gly Val Gly Ser Lys Ser Lys Leu Lys Phe Leu ' 
450 455 460 

CCT CCT CAG GCG GAG CAA GAA GAC TCC ACC AAG TTC ATC GCG CTC ACC 

Pro Pro Gin Ala Glu Gin Glu Asp Ser Thr Lys Phe lie Ala Leu Thr 
465 470 475 480 

CTG GTC TCC CTC GCC TGC ATC CTG GGC GTC CTC CTG GCC TCT GGC CTC 

Leu Val Ser Leu Ala Cys lie Leu Gly Val Leu Leu Ala Ser Gly Leu 
485 490 495 

ATC TAC TGC CTC CGC CAT AGC TCT CAG CAC AGG CTG AAG GAG AAG CTC 

Ii e Tyr Cvs Leu Arg His Ser Ser Gin His Arg Leu Lys Glu Lys Leu 
500 505 510 
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TCG GGA CTA GGG GGC 
Ser Giy Leu Gly Gly 

GAG CTG TGC CGC GAG 

Glu Leu Cys Arg Gin 
530 

GGC CCG CAC ACG TCA 

Gly Pro His Thr Ser 
545 

GGG CCG ATC CCC AGC 

Glv Pro lie Pro Ser 
565 

GAG GAG CCT GTG CAG 

Glu Glu Pro Val Gin 
5 60 

CTG TCC TAC ATG GAG 

Leu Ser Tyr Met Glu 
595 

GAG TGG GAA GCG CTG 

Glu Trp Glu Ala Leu 
610 

GTG GCC CAG AGG GAG 

Val Ala Gin Arg Glu 
625 

CTG ACC TAT GAC CAC 

Leu Thr Tyr Asp His 
645 

AGC CAC TCA GAC TAC 

Ser His Ser Asp Tyr 
660 

AGG AAC CCC GCG TAC 

Arg Asn Pro Ala Tyr 
675 



GAC CCA GGT GCA GAT 

Asd Pro Gly Ala Asp 
52C 

CGT ATG GCC ACG CGG 

Arg Met: Ala Thr Arg 
535 

CGC ATC AGC AGC GTC 

Arg lie Ser Ser Val 
550 

CCC TCC GCA CGC AGC 

Pro Ser Ala Arg Ser 
570 

TCC AAC ATG GAC ATC 

Ser Asn Meu Asp lie 

585 

GAC CAC CTG AAG AAC 

Asp His Leu Lys Asn 
600 

TGC GCC TAC CAG GCG 

Cys Ala Tyr Gin Ala 
615 

GAG AAC GTG CCC AAG 

Glu Asn Val Pro Lys 
630 

TCC CGG GTC CTG CTG 

Ser Arg Val Leu Leu 
650 

ATC AAC GCT AGC CCC 

lie Asn Ala Ser Pro 
665 

ATC GCC ACC CAG GGA 

lie Ala Thr Gin Gly 
680 



GCC ACT GCC GCC TAC CAG 

Ala Thr Ala Ala Tyr Gin 
5 2 5 

CCA CCA GAC CGA CCT GAG 

Pro Pre Asp Arg Pro Glu 
540 

TCA TCC CAG TTC AGC GAC 

Ser Ser Gin Phe Ser Asp 
555 560 

AGC GCC TCA TCC TGG TCC 

Ser Ala Ser Ser Trp Ser 
575 

TCC ACC GGC CAC ATG ATC 

Ser Thr Gly His Met He 
590 

AAG AAC CGG CTG GAG AAG 

Lys Asn Arg Leu Glu Lys 
605 

GAG CCC AAC AGC TCG TTC 

Glu Pro Asn Ser Ser Phe 
620 

AAC CGC TCC CTG GCC GTG 

Asn Arg Ser Leu Ala Val 
635 640 

AAG GCG GAG AAC AGC CAC 

Lys Ala Glu Asn Ser His 
655 

ATC ATG GAT CAC GAC CCG 

He Met Asp His Asp Pre 
670 

CCG CTG CCC GCC ACC GTG 

Pro Leu Pro Ala Thr Val 
665 
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GCT GAC 7TT TGG CA -i ATG GTG TGG GAG AGC GGC TCC GTG GTG ATG GTC 

Ala Aso Phe Trp Gin Met Val Trp Glu Ser Gly Cys Val Val lie Val 
690 * 695 700 

ATG CTG AC A CCC CTC GCG GAG AAC GGC GTC CGG CAG TGC TAC CAC TAG 

Met Leu Thr Pro Leu Ala Glu Asn Gly Val Arg Gin Cys Tyr His Tyr 
705 710 715 720 

TGG CCG GAT GAA GGC TCC AAT CTC TAC CAC ATC TAT GAG GTG AAC CTG 

Trp Pro Asp Glu Gly Ser Asn Leu Tyr His lie Tyr Glu Val Asn Leu 
725 730 735 

GTC TCC GAG CAC ATC TGG TGT GAG GAC TTC CTG GTG AGG AGC TTC TAT 

Val Ser Glu His lie Trp Cys Glu Asp Phe Leu Val Arg Ser Phe Tyr 
740 745 750 

CTG AAG AAC CTG CAG ACC AAC GAG ACG CGC ACC GTG ACG CAG TTC CAC 

Leu Lys Asn Leu Gin Thr Asn Glu Thr Arg Thr Val Thr Gin Phe His 
755 760 765 

TTC CTG ACT TGG TAT GAC CGA GGA GTC CCT TCC TCC TCA AGG TCC CTC 

Phe Leu Ser Trp Tyr Asp Arg Gly Val Pro Ser Ser Ser Arg Ser Leu 
770 775 780 

CTG GAC TTC CGC AGA AAA GTA AAC AAG TGC TAC AGG GGC CGT TCT TGT 

Leu Asp Phe Arg Arg Lys Val Asn Lys Cys Tyr Arg Gly Arg Ser Cys 
785 790 795 800 

CCA ATA ATT GTT CAT TGC AGT GAC GGT GCA GGC CGG AGC GGC ACC TAC 

Pro He He Val His Cys Ser Asp Gly Ala Gly Arg Ser Gly Thr Tyr 
805 810 €15 

GTC CTG ATC GAC ATG GTT CTC AAC AAG ATG GCC AAA GGT GCT AAA GAG 

Val Leu He Asp Met Val Leu Asn Lys Met Ala Lys Gly A] a Lys Glu 
820 825 830 

ATT GAT ATC GCA GCG ACC CTG GAG CAC TTG AGG GAC CAG AGA CCC GGC 

He Asp He Ala Ala Thr Leu Glu His Leu Arg Asp Gin Arg Pro Gly 
835 840 845 

ATG GTG CAG ACG AAG GAG CAG TTT GAG TTC GCG CTG ACA GCC GTG GCT 

Met Val Gin Thr Lys Glu Gin Phe Glu Phe Ala Leu Thr Ala Val Ala 
850 855 860 

GAG GAG GTG AAC GCC ATC CTC AAG GCC CTT CCC CAG TGAGCGGCAG 



BNSDOCtD: <WO 9722694A2> 



WO 97/22694 



PCT/CA96/00867 



Glu Glu Va: As: ; A"!.: • 1-: L'.vu Lys Ala Leu Pro Gl:: 
865 o70 075 

CCTCAGGGOt GTC A* 4 . - : - J GG AG rCCCTACCCC ACGGATCTTG TCAGGAATrA TGATCTGACT 
TTAATTGTGT GTCTTCTAT7 AT AAC TG CAT AGTAATAGGG CCCTTAGCTC TCCCGTAGTC 
AGCGCAGTT7 AGCAGTTAAA AGTGTATTTT TGTTTAATCA AACAATAATA AAG AGAGGTT 
TGTCGGAAAA AT G C AAAAAA AA 

SEQ- ID. Nos. 10 and 11 Complete nucleotide (SEQ ID No- 10) and 
amino acid (SEQ ID NO. 11) sequence of IAR. 

In SEQ ID No. II, the signal peptide is underlined a r» c the 
transmembrane regie:: ir ccubl e - undo r 1 i ncd . An extracellular RGDS 
sequence ".c a potent ia] site of N- linked glycosyla t ion are in 
bold. Th*.-. active site or trie phosphatase is found at arcino acids 
942-954. 

GCCTCCCGCC GCCTCCCCCG CGGCCATGGA CTGAGCGCCG CCGGCCAGGC CGCGGGG 

ATG GGG CCG CCG CTC CCG CTG CTG CTG CTG CTA CTG CTG CTG CTG CCG 
105 



Met Glv Pro Pro Leu Pro Leu Leu Leu Leu Leu Leu Leu Leu Leu Pro 

1 ' " 



10 1 * 



CCA CGC GTC CTG CCT GCC GCC CCT TCG TCC GTC CCC CGC GGC CC 



CAG 



P-o Arc VrJ Leu Pro Ala Ala Pro Ser Ser Val Pro Arg Gly Arg Gin 
20 25 30 

CTC CCG GGG CGT CTG GGC TGC CTG CTC GAG GAG GGC CTC TGC GGA GCG 

Leu Pro Glv Arg Leu Gly Cys Leu Leu Glu Glu Gly Leu Cys Gly Ala 
35 4 0 4 5 

TCC GAG GCC TGT GTG AAC GAT GGA GTG TTT GGA AGG TGC CAG AAG GTT 

Ser Glu Ala Cys Val Asn Asp Gly Val Phe Gly Arg Cys Gin Lys Val 
50 55 60 

CCG GCA'ATG GAC TTT TAC CGC TAC GAG GTG TCG CCC GTG GCC CTG CAG 

Pro Ala Met Aso Phe Tyr Arg Tyr Glu Val Ser Pro Val Ala Leu Gin 
65 \ 70 75 80 
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- 58 - 

CGC CTG CGC GTG GCG TTG CAG AAG CTT TCG GGC AC A GGT TTC ACG TGG 

Arg Leu Arc Val Alu Leu Gin Lys Leu Ser Gly Thr Gly Phe Thr Trp 

P5 90 < J 5 

CAG GAT GAC TAT ACT CAG TAT GTG ATG GAC CAG GAA CTT GCA GAG CTC 

Gin Asr> Asp Tvr Thr Gin Tyr Val Met Asp Gin Glu Leu Ala Asp Leu 
ICO * 105 110 

CCG AAA ACC TAC CTG AGG CGT CCT GAA GCA TCC AGC CCA GCC AGG CCC 

Pro Lys Thr Tyr Leu Arg Arg Pro Glu Ala Ser Ser Pro Ala Arg Pro 
115 ' 120 125 

TCA AAA CAC AGC GTT GGC AGC GAG AGG AGG TAC AGT CGG GAG GGC GGT 

Ser Lys His Ser Val Gly Ser Glu Arg Arc Tyr Ser Arg Glu Gly Gly 
130 135 140 

GCT GCC CTG GCC AAC GCC CTC CGA CGC CAC CTG CCC TTC CTG GAG GCC 

Ala Ala Leu Ala Asn Ala Leu Arq Arg His Leu Pro Phe Le.; Glu Ala 
145 150 155 160 

CTG TCC CAG GCC CC7\ GCC TCA GAC GTG CTC GCC AGG ACC CAT ACG GCG 

-Leu Ser GI" Ala Pro Ala Ser Asp Val Leu Ala Arg Thr His Thr Ala 
165 170 175 

CAG GAC AG A CCC CCC GCT GAG GGT GAT GAC CGC TTC TCC GAG AGC ATC 

Gin Asp Arg Pro Pro Ala Glu Gly Asp Asp Arg Phe Ser Glu Ser 1I«= 
180 185 190 

CTG ACC TAT GTG GCC CAC ACG TCT GCG CTG ACC TAC CCT CCC GGG TCC 

Leu Thr Tyr Val Ala His Thr Ser Ala Leu Thr Tyr Pro Pre Gly Ser 
195 200 205 

CGG ACC CAG CTC CGC GAG GAC CTC CTG CCG CGG ACC CTC GGC CAG CTC 

Arg Trr Gin Leu Arg Glu Asp Leu Leu Pre Arg Thr Leu G:.y Gin Leu 
210 215 220 

CAG CCA GAT GAG CTC AGC CCT AAG GTG GAC AGT GGT GTG GAC AGA CAC 

Glr Pro Asd Glu Leu Ser Pro Lys Val Asp Ser Gly Val Asp Arg Hi. 5 
225 " 230 235 24 0 

CAT CTG ATG GCG GCC CTC AGT GCC TAT GCT GCC CAG AGG CCC CCA GCT 

H^s Le- Met Ala Ala Leu Ser Ala Tyr Ala Ala Gin Arg Pro Pro Ala 
245 250 255 
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CCC CCC GGG GAG GGC AGC CTG GAG CCA CAG TAC CTT CTG CGT GCA CCC 

Pro Pre Glv Giu Gly Ser Leu Glu Pre Gin Tyr Leu Leu Arc Ala Pre 

260 265 270 

TCA AG A ATG CCC AGG CCT TTC CTG GCA CCA GGC GCC CCC CAG AAG TGG 

Ser Arq Me: Pre Arg Pro Leu Leu Ala Pro Ala Ala Pro Gin Lys Trp 

27b " 2B0 285 

CCT TCA CCT CTG GGA GAT TCC GAA GAC CCC TCC AGC ACA GGC GAT GGA 

Pro Ser Pre Leu Gly Asp Ser Glu Asp Pro Ser Ser Thr Gly Asp Gly 

290 295 300 

GCA CGG ATT CAT ACC CTC CTG AAG GAC CTG CAG AGG CAG CCG GCT GAG 

Ala Arg lie His Thr Leu Leu Lys Asp Leu Gin Arg Gin Pro Ala Glu 

305 310 315 320 

GTG AGG GGC CTG AGT GGC CTG GAG CTG GAC GGC ATG GCT GAG CTG ATG 

Val Aro Glv Leu Ser Glv Leu Glu Leu Asc Gly Met Ala Giu Leu Met. 

325 * 330 " 335 

GCT GGC CTG ATG CAA GGC GTG GAC CAT GGA GTA GCT CGA GGC AGC CCT 

Ala Glv Leu Me: Gin Gly Val Asp His Gly Val Ala Arg Gly Ser Pro 

340 345 350 

GGG AG A GCG GCC CTG GGA GAG TCT GGA GAA CAG GCG GAT GGC CCC AAG 

Gly Arg Ala Ala Leu Gly Glu Ser Gly Glu Gin Ala Asp Gly Pro Lys 

355 360 365 

GCC ACC CTC CGT GGA GAC AGC TTT CCA GAT GAC GGA GTG CAG GAC GAC 

Ala Thr Leu Arg Gly Asp Ser Phe Pro Asp Asp Gly Val Gin Asp Asp 

370 375 * "380 

GAT GAT AG A CTT TAC CAA GAG GTC CAT CGT CTG AGT GCC ACA CTC GGG 

Asp Asp Arg Leu Tyr Gin Glu Val His Arg Leu Ser Ala Thr Leu Gly 

385 390 395 400 

GGC CTC CTG CAG GAC CAC GGG TCT CGA CTC TTA CCT GGA GCC CTC CCC 

Gly Leu Leu Gin Asp His Gly Ser Arg Leu Leu Pro Gly Ala Leu Pro 

405 410 415 

TTT GCA AGG CCC CTC GAC ATG GAG AGG AAG AAG TCC GAG CAC CCT GAG 

Phe Ala Arg Pro Leu Asp Met Glu Arg Lys Lys Ser Glu His Pro Glu 

420 425 430 

TCT TCC CTG TCT TCA GAA GAG GAG ACT GCC GGA GTG GAG AAC GTC AAG 
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Ser Ser Leu Scr Ser Glu GIu Glu Thr Ala Gly Val Glu Arsr- Val Lys 

4 3 5 4 4 0 4 4 5 

AGC CAG ACG TAT TCC AAA GAT CTG CTG GGG GAG CAG CCG CAT TCG GA3 

Ser Gin Thr Tyr Ser Lys Assp Leu Leu Gjy Gin Gin Pro His Ser Glu 

450 455 * " 460 

CCC GGG G'CC GCT GCG TTT GGG GAG CTC CAA AAC CAG ATG CCT GGG CCC 

Pro Gly Ala Ala Ala Phe Gly Glu Leu Gin Asn Gin Met Pre Gly Pro 

465 470 475 460 

TCG AAG GAG GAG CAG AGC CTT CCA GCG GGT GCT CAG GAG GCC CTC AGC 

Ser Lys Glu Glu Gin Ser Leu Pro Ala Gly Ala Gin Glu Ala Leu Ser 

485 490 495 

GAC GGC CTG CAA TTG GAG GTC CAG CCT TCC GAG GAA GAG GCG CGG GGC 

Asp Gly Leu Gin Leu Glu Val Gin Pro Ser Glu Glu Glu Ala Arg Gly 

500 505 510 

TAC ATC GTG ACA GAC AG A GAC CCC CTG CGC CCC GAG GAA GGA AGG CGG 

Tyr lie Val Thr Asp Arg Asp Pro Leu Arg Pro Glu Glu Gly Arg Ar^ 

515 " 520 525 

CTG GTG GAG GAC GTC GCC CGC CTC CTG CAG GTG CCC AGC AGC GCG TTC 

Leu Val Glu Asp Val Ala Arc Leu Leu Gin Val Pro Ser Ser Ala Phe 

530 535 " 540 

GCT GAC GTG GAG GTT CTC GGA CCA GCA GTG ACC TTC AAA GTG AGC GCC 

Ala Asp Val Glu Val Leu Gly Pro Ala Val Thr Phe Lys Val Ser Ala 

545 550 555 560 

AAT GTC CAA AAC GTG ACC ACT GAG GAT GTG GAG AAG GCC ACA GTT GAC 

Asn Val Gin Asn Val Thr Thr Glu Asp Val Glu Lys Ala Thr Val Asp 

565 570 575 

AAC AAA GAC AAA CTG GAG GAA ACC TCT GGA CTG AAA ATT CTT CAA ACC 

Asn Lys Asp Lys Leu Glu Glu Thr Ser Gly Leu Lys lie Leu Gin Thr 

580 585 590 

GGA GTC GGG TCG AAA AGC AAA CTC AAG TTC CTG CCT CCT CAG GCG GAG 

Gly Val Gly Ser Lys Ser Lys Leu Lys Phe Leu Pro Pro Gin Ala Glu 

595 600 605 

CAA GAA GAC TCC ACC AAG TTC ATC GCG CTC ACC CTG GTC TCC CTC GCC 
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Gin G-'u Ar-p Ser Thr Lvs Ph^ Tl^ Ala Leu Thr i.vu Val Ser Leu JAr, 
610 615 620 

TGC ATC CTC, GGC GTC CTC CTG GCC TCT GGC CTC ATC TAG TGC CTC CGC 




CAT AGC TCT CAG CAC AGG CTG AAG GAG AAG CTC TCG GGA CTA GGG GGC 

His Ser Se- Gin His Arg Leu Lys Glu Lys Leu Ser Gly Leu Gly Gly 
6 45 650 655 

GAC CCA GGT GCA GAT GCC ACT GCC GCC TAC CAG GAG CTG TGC CGC CAG 

Asp Pro Glv Ala Asp Ala Thr Ala Ala Tyr Gin Glu Leu Cys Arg Gin 
660 665 670 

CGT ATG GCC ACG CGG CCA CCA GAC CGA CCT GAG GGC CCG CAC ACG TCA 

A>"Q M*C Ala Thr Arq Pro Pre Asp Arc' Pro Glu Gly Pro His Thr Ser 
67 c 660 . 665 

CGC ATC AGC AGC GTC TCA TCC CAG TTC AGC GAC GGG CCG ATC CCC AGC 

A-a Ii= Se- Ser Val Ser Ser Gin Phe Ser Asp Gly Pro lie Pro Ser 
ilo " 6S5 70C 

CCC TCC GCA CGC AGC AGC GCC TCA TCC TGC- TCC GAG GAG CCT GTG CAG 

Pro Se- Ala Arq Ser Ser Ala Ser Ser Tr? Ser Glu Glu Pro Val Gin 
705 " 710 715 7^0 

TCC AAC ATG GAC ATC TCC ACC GGC CAC ATG ATC CTG TCC TAC ATG GAG 

Ser As- M-t Aso He Ser Thr Gly His Met lie Leu Ser Tyr Met: Glu 
7 - 25 73C 73d 

GAC CAC CTG AAG AAC AAG AAC CGG CTG GAG AAG GAG TGG GAA GCG CTG 

Asp His Leu Lvs Asn Lys Asn Arg Leu Glu Lys Glu Trp Glu Ala Leu 
740 ~ . 745 750 

TGC GCC TAC CAG GCG GAG CCC AAC AGC TCG TTC GTG GCC CAG AGG GAG 

Cvs Ala Gin Ala Glu Pro Asn Ser Ser Phe Val Ala Gin Arg Glu 

755 ' 760 765 

GAG AAC GTG CCC AAG AAC CGC TCC CTG GCC GTG CTG ACC TAT GAC CAC 

Glu As" Va- Pro Lys Asn Arg Ser Leu Ala Val Leu Thr Tyr Asp His 
770 " 775 780 

TCC CGG GTC CTG CTG AAG GCG GAG AAC AGC CAC AGC CAC TCA GAC TAC 
Ser Arc Val Leu Leu Lys Ala Glu Asn Ser His Ser His Ser Asp Tyr 
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785 790 795 SOC 

ATC AAC GCT AGC CCC ATC ATG GAT CAC GAC CCG AGG AAC CC7 GCG TAC 

lie Asn Ala Ser Pro He Met Asp His Asp Pro Arc: Apr. Pro Aio Tyr 
80S 810 ^ " ; ; 

ATC GCC ACC CAG GGA CCG CTG CCC GCC ACC GTG GCT GAC TTT TGG CAC 

lie Ala Thr Gin Gly Pro Leu Pro Ala Thr Val Ala Asp Phe Trp Glr 
820 825 830 

ATG GTG TGG GAG AGC GGC TGC GTG GTG ATC GTC ATG CTG ACA CCC CTC 

Met Val Trp Glu Ser Gly Cys Val Val He Val Met Leu Thr Pro Leu 
835 840 845 

GCG GAG AAC GGC GTC CGG CAG TGC TAC CAC TAC TGG CCG GAT GAA GGC 

Ala Glu Asn Gly Val Arg Gin Cys Tyr His -Tyr Tro Pro Asp Glu Glv 
850 85 5 66 0 

TCC AAT CTC TAC CAC ATC T/vT GAG GTG AAC CTG GTC TCC G A 1- CAC ATC 

Ser Asn Leu Tyr His He Tyr Glu Val Asn Leu Val Ser Glu His He 
865 870 875 880 

TGG TGT GAG GAC TTC CTG GTG AGG AGC TTC TAT CTG AAG AAC CTG CAG 

Trp Cys Glu Asp Phe Leu Val Arg Ser Phe Tyr Leu Lys Asn Leu Gin 
885 " 890 * 895 

ACC AAC GAG ACG CGC ACC GTG ACG CAG TTC CAC TTC CTG AG I TGG TAT 

Thr Asn Glu Thr Arg Thr Val Thr Gin Phe His Phe Leu Ser Trp Tyr 
900 905 910 

GAC CGA GGA GTC CCT TCC TCC TCA AGG TCC CTC CTG GAC TTC CGC AG A 

Asp Arg Gly Val Pro Ser Ser Ser Arg Ser Leu Leu Asp Phe Arg Arg 
915 920 S25 

AAA GTA AAC AAG TGC TAC AGG GGC CGT TCT TGT CCA ATA ATT GTT CAT 

Lys Val Asn Lys Cys Tyr Arg Gly Arg Ser Cys Pro lie lie Val His 
930 935 940 

TGC AGT GAC GGT GCA GGC CGG AGC GGC ACC TAC GTC CTG ATC GAC ATG 

Cys Ser Asp Gly Ala Gly Arg Ser Gly Thr Tyr Val Leu He Asp Met: 
945 950 955 960 

GTT CTC AAC AAG ATG GCC AAA GGT GCT AAA GAG ATT GAT ATC GCA GCC 

Val Leu Asn Lys Met Ala Lys Gly Ala Lys Glu He Asp lie Ala Ala 
965 970 * 975 
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ACC CTG GAG CAC TTG AGG GAC CAG AGA CCC GGC ATG GTC CAG ACQ AAG 



Th>- Lou Glu His Leu Arg Asp Gin Arg Pre Gly Met Vai Gin Thr Lys 

980 985 990 

GAG CAG TTT GAG TTC GCG CTG AC A GCC GTG GCT GAG GAG GTC AAC GCC 

Glu Gin Phe Glu Phe Ala Leu Thr Ala Val Ala Glu Glu Val Asn Ala 
995 1000 1005 

ATC CTC AAG GCC CTT CCC CAG T GAGCGGCAGC CTCAGGGGCC TCAGGGGAGC 

lie Leu Lys Ala Leu Pro Gin 
1010 1° 15 

CCCTACCCCA CGGATGTTGT CAGGAATCAT GATCTGACTT TAATTGTGTG TCTTCTATTA 
TAACTGCATA GTAATAGGGC CCTTAGCTCT CCCGTAGTCA GCGCAGTTTA GCAGTTAAAA. 
GTGTATTTTT GTTTAATCAA ACAATAATAA AG AG AG GTT T GTCGGAAAAA TCCAAAA 



SEQ ID Nos 12 and 13: Nucleotide (SEQ ID No. 12) and amino acid 
(SEQ ID NO. 13) sequences of the cytoplasmic (intracellular) 
domain of IAR. SEQ ID NO. 12 corresponds to the nucleotides of 
SEQ ID No. 10 which encode amino acids 646 to 1015 of SEQ ID' No . 
11. SEQ ID No. 13 corresponds to amino acids 646 to 1015 of SEQ 
ID No . 11 • 

AGG CTG AAG GAG AAG CTC TCG GGA CTA GGG GGC 

Arg Leu Lys Glu Lys Leu Ser Gly Leu Gly Gly 

650 655 

GAC CCA GGT GCA GAT GCC ACT GCC GCC TAG CAG GAG CTG TGC CGC CAG 

Asp Pro Gly Ala Asp Ala Thr Ala Ala Tyr Gin Glu Leu Cys Arg Gin 
660. 665 ,. 670 

CGT ATG GCC ACG CGG CCA CCA GAC CGA CCT GAG GGC CCG CAC ACG TCA 



Arc Met Ala Thr Arg Pro Pro Asp Arg Pro Glu Gly Pro His Thr Ser 
675 680 685 

CGC ATC AGC AGC GTC TCA TCC CAG TTC AGC GAC GGG CCG ATC CCC AGC 
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Arc; lie Ser Ser 

690 

CCC TCC GCA CGC 

Pro Ser Ala Arc 
705 

TCC AAC ATG GAC 

Ser Asn Met Asp 

GAC CAC CTG AAG 

Asp His Leu Lys 
740 

TGC GCC TAG CAG 

Cys AK-. Tyr Gin 
755 

GAG AAC GTG CCC 

Glu Asn Val Pro 
770 

TCC CGG GTC CTG 

Ser Arg Val Leu 
765 

ATC AAC GCT AGC 

lie Asn Ala Ser 

ATC GCC ACC CAG 

He Ala Thr Gin 
82C 

ATG GTG TGG GAG 

Met Val Trp Glu 
835 

GCG GAG AAC GGC 

Ala Glu Asn Gly 
850 

TCC AAT CTC TAC 



Val Ser Ser Gin 
695 

AGC AGC GCC TCA 

Ser Ser Ala Ser 
710 

ATC TCC ACC GGC 

He Ser Thr Gly 
725 

AAC AAG AAC CGG 
Asn Lys Asn Arg 

GCG GAG CCC AAC 

Ala Glu Pro Asn 
760 

AAG AAC CGC TCC 

Lys Asn Arg Ser 
775 

CTG AAG GCG GAG 

Leu Lys Ala Glu 
790 

CCC ATC ATG GAT 

Pro He Met Asp 
805 

GGA CCG CTG CCC 
Gly Pro Leu Pro 

AGC GGC TGC GTG 

Ser Gly Cys Val 
840 

GTC CGG CAG TGC 

Val A.rg Gin Cys 
855 

CAC ATC TAT GAG 



Phe Ser Asp Gly 
700 

TCC TGG TCC GAG 

Ser Trp Ser Glu 
715 

CAC ATG ATC CTG 

His Met He Leu 
730 

CTG GAG AAG GAG 

Leu Glu Lys Glu 
745 

AGC TCG TTC GTG 

Ser Ser Phe Val 

CTG GCC GTG CTG 

Leu Ala Val Leu 
780 

AAC AGC CAC AGC 

Asn Ser His Ser 
795 

CAC GAC CCG AGG 

His Asp Pro Arg 
810 

GCC ACC GTG GCT 

Ala Thr Val Ala 
825 

GTG ATC GTC ATG 
Val lie Val Met 

TAC CAC TAC ^GG 

Tyr His Tyr _ rp 
860 

GTG AAC CTG GTC 



Pro II 2 Pro Ser 

GAG CCT GTG CAG 

Glu Pro Val Gin 
720 

TCC TAC ATG GAG 

Ser Tyr Met Glu 
735 

TGG GAA GCG CTG 

Trp Glu Ala Leu 
750 

GCC CAG - AGG GAG 

Ala Gin Arg Glu 
765 

ACC TAT GAC CAC 

Thr Tyr Asp His 

CAC TCA GAC TAC 

His Ser Asp Tyr 
800 

AAC CCC GCG TAC 

Asn Pro Ala Tyx 
815 

GAC TTT TGG CAG 

Asp Phe Trp Glr: 
830 

CTG ACA CCC CTC: 

Leu Thr Pro Leu 
845 

CCG GAT GAA GGC 
Pro A.sp Glu Glv 

TCC GAG CAC ATC 
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^e^- Al-ii Leu Tv: His lie Tvr Glu Val Asn Lieu Val Ser Glu His lie 

^ * 67G * 675 88C 

TGG TC-7 GAG GAC TTC CTG GTG AGG AGC TTG TAT CTG AAG AAC CTG CAG 

Tvr . <* Vf - --..J ^c:.. pv v: . i je ,_ : y a x Arc £ei Phe: Tyr Leu I..yr;. Asn Leu Gin 

305 " 890 ' * 895 

ACC AAC GAG ACG GGC ACC GTG ACG CAG TTC CAC TTC CTG AGT TGG TAT 

Thr Asn Glu Thr Arg Thr Val Thr Gin Phe His Phe Leu Ser Trp Tyr 

900 905 910 

GAC CGA GGA GTC CCT TCC TCC TCA AGG TCC CTC CTG GAC TTC CGC AGA 

Asp Arg Gly Val Pro Ser Ser Ser Arg Ser Leu Leu Asp Phe Arg Arg 

915 920 925 

AAA GTA AAC AAG TGC TAC AGG GGC CGT TCT TGT CCA ATA ATT GTT CAT 

Lvr- V\-.l Ann Lvs Cvs Tyr Arq Gly Arg Scr Cys Pro Ilo lie Val His 

930 " * 935 ' " " 940 

TGC AGT GAC GGT GCA GGC CGG AGC GGC ACC TAC GTC CTG ATC GAC ATG 

Cys Ser Asp GIv Ala Gly Arg Ser Gly Thr Tyr Val Leu lie Asp Met 

945 ' 950 955 960 

GTT CTC AAC AAG ATG GCC AAA GGT GCT AAA GAG ATT GAT ATC GCA GCG 

Vai Leu Asn Lys Mec Ala Lys Gly Ala Lys Glu lie Asp lie Ala Ala 

965 97C 975 

ACC CTG GAG CAC TTG AGG GAC CAG AGA CCC GGC ATG GTC CAG ACG AAG 

Thr Leu Glu His Leu Arg Asp Gin Arg Pre Gly Met Val Gin Thr Lys 

980 985 990 

GAG CAG TTT GAG TTC GCG CTG ACA GCC GTG GCT GAG GAG GTG AAC GCC 

Glu Gin Phe Glu Phe Ala Leu Thr Ala Val Ala Glu Glu Val Asn Ala 

995 1000 1005 

ATC CTC AAG GCC CTT CCC CAG 

lie Leu Lys Ala Leu Pro Gin 

101C 1015 
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CLAIMS 

1. a nucleic acid encoding a polypeptide that has the 
properties of an Insulin-dependent diabetes mellitts 

( IDDM) -associated auroantigen, which nucleic acid 
comprises : 

(a) the coding sequence of SEQ. ID. No. I, 10 or 12, 
and/or a sequence complementary thereto; 

(b) a sequence which hybridises to a sequence defined in 
(a); 

(c) a sequence that is degenerate as a result of the 
genetic code to a nucleic acid sequence defined in 
(a) or (b) ; 

(d) a sequence having at least 80% homology to a 
sequence 

defined in (a) , (b) or (c) . 

2. A nucleic acid according to claim 1 comprising the 
coding sequence of SEQ . ID. No. 1, 10 or 12. 

3. a polypeptide encoded by a nucleic acid according to 
claim I or 2 . 

4. a polypeptide according to claim 3 which comprises 
the amino acid sequence of SEQ . ID. No. 2, 11 or 12; or one 
or more parts thereof . 

5. a nucleic acid vector including a nucleic acid as 
defined in claim 1 or 2. 

6. A cell harbouring a vector according to claim 5 or a 
nucleic acid according to claim 1 or 2 . 

7. A method of producing a polypeptide according to 
claim 3 or 4 which comprises culturing a cell according to 
claim 6 under conditions which permit the expression of the 
polypeptide; and recovering the polypeptide. 

8. A method of diagnosing IDDM which comprises: 

(a) contacting a sample from an individual with a 
polypeptide according to claim 3 or 4 under 
conditions that permit the recognition of the 
polypeptide by autoantibodies that recognise IAR; 
and 

(b) determining whether or not said recognition occurs. 
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cj . A method according :c claim £ which further 

com pr i s e : 

(c) contacting a sample from the individual with at 
ieasL cne further IDDM-asscciated autoantigen, or a 
polypeptide that functions as an i DDM -associated 
antigen, under conditions which permit recognition 
cf the autoantigen or polypeptide by autoantibodies ; 
and 

(d) determining whether or not said recognition occurs. 

10. A method according to claim 9 wherein a further 
autoantigen or polypeptide is hIA-2, GAD, ICA-512, ICA-12, 
ICA-13. ICA-208, ICA-302, ICA-313, ICA-525 or ICA-505; or a 
polypeptide that functions as one ct these further auto- 
autoant igens . 

11. A method according to claim fa, 9 or 10 wherein the 
sample is taken from a patient who is a first-degree 
relation of a known I DDM sufferer. 

12 . Use of a polypeptide according to claim 3 or 4 in 
diagnosing I DDM . 

13. A diagnostic. test kit for diagnosing IDDM which 
comprises a polypeptide according tc claim 3 or 4 , and 
means for determining whether or not the polypeptide is 
recognised by autoantibodies that recognise IAR . 

14. A kit according to claim 13 which further comprises 
at least one further IDDM-associated autoantigen. 

15. A kit according to claim 14 which further comprises 
means for determining whether or not the further 
autoantigen is recognised by autoantibodies. 

16. A kit according to claim 14 or 15 wherein a further 
autoantigen or polypeptide is hIA-2, GAD, ICA-512, ICA-12, 
ICA-13, ICA-208, ICA-302, ICA-313, ICA-525 or ICA-505; or a 
polypeptide that functions as one of these further auto- 
autoantigens . 
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